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OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



January 9, 2004, 12:21:36 ; Search time 37.3949 Seconds 

(without alignments) 
1795.466 Million cell updates/sec 

US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLD I DE INFRGSSVIRLRI YVSQYPF 423 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
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embl / AA1 980. DAT 
embl / AA1 9 8 1 . DAT 
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embl /AA1 983 .DAT 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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Human extracellula 
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1276.5 


53 . 
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443 


20 


AAY16587 


Extracellular prot 


35 
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AAB33418 


Human PR0226 prote 
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443 
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AAY84707 


A human p53 mutant 
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AAY55850 


Human Sl-5 ECMP-li 
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Human PR0226 polyp 
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Human PR0226 polyp 
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443 
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Human PRO polypept 
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Human secreted/tra 
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443 
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ALIGNMENTS 



RESULT 1 




AAW79739 




ID 


AAW79739 standard; Protein; 448 AA. 


XX 






AC 


AAW79739; 




XX 






DT 


25-JAN-1999 


(first entry) 


XX 






DE 


Human EEGF protein. 


XX 






KW 


Extracellular/epidermal growth factor-like protein; EEGF; human; liver 


KW 


vascular smooth muscle cell proliferation; neurology; pathology; AIDS; 


KW 


dementia; ocular; disorder; cornea; inflammation; tumour cell; kidney; 


KW 


wound; epithelium; surgery; skin graft; ulcer; diabetic; trophic; 


KW 


Marfan syndrome; treatment; hair follicle; embryogenesis ; neoplasma; 


KW 


epidermal cell; cancer; psoriasis; detection. 


XX 






OS 


Homo sanipnc? 




XX 






FH 


Key 


TiDraf "i on / Dn^ 1 i "F H pre 


FT 


Pe*nt* irip 


J. . . £i J 


FT 






FT 


Prot pin 


26 . .448 


FT 




/ laJJCl — Hi Hj\J c 


FT 




/iiulc- niALiaceiiuiar/ cpiueriiidi growtn raccor-xiKe 


FT 




pi ULClll 


FT 


Ppcri on 




FT 




/label- EGF-l 


FT 


Region 


154 . . 190 


FT 




/label= EGF-2 


FT 


Region 


191. .230 


FT 




/label= EGF-3 


FT 


Region 


231. .271 


FT 




/label= EGF-,4 


FT 


Region 


,272. .314 


FT 




/label= EGF-5 


XX 






PN 


W09846746-A1. 




XX 






PD 


22-OCT-1998 . 




XX 






PF 


ll-APR-1997; 


97WO-US06020. 


XX 






PR 


ll-APR-1997; 


97WO-US06020. 


XX 






PA 


(HUMA-) HUMAN 


: GENOME SCI INC. 


XX 






PI 


Li H, Olsen 


HS; 


XX 

AA 






DR 


WPI; 1998-568728/48. 


DR 


N-PSDB; AAV62432. 


XX 






PT 


New isolated extracellular/ epidermal growth factor - used for 


PT 


regulating vascular smooth muscle cell proliferation, e.g. for 


PT 


enhancing neurological functions or treating neoplasia and other 


PT 


disorders . 




XX 







PS Claim 10a; Fig 1A-D; 62pp ; English. 
XX 

CC This sequence represents a novel human extracellular/ epidermal growth 

CC factor-like protein, EEGF. This protein can be used to regulate 

CC vascular smooth muscle cell proliferation and for restoration or 

CC enhancement of neurological functions diminished as a result or other 

CC damaging pathologies such as AIDS dementia. The protein can also be used 

CC to treat senile dementia, ocular disorders such as corneal inflammation, 

CC for targeting tumour cells, for treating kidney disorders, for liver 

CC regeneration or treating liver dysfunction, for treating wounds including 

CC all cutaneous wounds, corneal wounds, and injuries to the 

CC epithelial-lined hollow organs of the body or resulting from trauma such 

CC as burns, abrasions and cuts as well as from surgical procedures such as 

CC surgical incisions and skin grafting. The polypeptides can also be used 

CC for treating chronic conditions, such as chronic ulcers, diabetic ulcers, 

CC other non-healing (trophic) conditions, to treat Marfan syndrome, to 

CC promote hair follicular development, to stimulate growth and 

CC differentiation of various epidermal and epithelial cells in vivo and in 

CC vitro and to stimulate embryogenesis . Antagonists to EEGF can be used to 

CC treat neoplasia such as cancers or tumours, skin disorders such as 

CC psoriasis or corneal inflammation. The products can also be used for 

CC identifying EEGF receptors, detection, diagnosis and drug screening. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2407; DB 19; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 14 5 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

M M MM II MMII II Ml II II MMMI IM II III I 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMM 

Db 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

MMM II MM MIMIIMM MMII IMMMMII IIMMIMI II 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 

II IMIMMMI II MMII MIIIIIMMIIII III MMMMM 

Db 386 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 445 



Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 2 
AAW95709 

ID AAW95709 standard; Protein; 448 AA. 
XX 

AC AAW95709; 
XX 

DT 21-JUN-1999 (first entry) 
XX 

DE Homo sapiens fetal kidney clone AK647 secreted protein. 
XX 

KW Secreted protein; fetal kidney. 
XX 

OS Homo sapiens. 
XX 

PN WO9900405-A1. 
XX 

PD 07-JAN-1999. 
XX 

PF 29-JUN-1998; 98WO-US13530 . 
XX 

PR 30-JUN-1997; 97US-0885610 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Agostino MJ, Evans C, Jacobs K, Lavallie ER, Mccoy JM; 

PI Merberg D, Racie LA, Treacy M; 

XX 

DR WPI; 1999-095671/08. 

DR N-PSDB; AAX07567. 
XX 

PT New polynucleotides encoding secreted human proteins - are derived 

PT from foetal kidney or adult retina cDNA libraries, used as, e.g. 

PT potential vaccines 
XX 

PS Claim 11; Pages 52-54; 76pp; English. 
XX 

CC The sequence is that of a secreted protein from a human fetal 

CC kidney clone AK296. Such a sequence is predicted to have biological 

CC activities which would make them suitable for treating, preventing or 

CC ameliorating medical conditions in humans and animals, although no 

CC supporting data is given. Suggested activities include nutritional 

CC activity, cytokine and cell proliferation/differentiation activity, 

CC immune stimulating (e.g. as vaccines) or suppressing activity, 

CC haematopoiesis regulating activity, tissue growth activity, 

CC activin/inhibin activity, chemotactic/chemokinet ic activity, haemostat 

CC and thrombolytic activity, receptor/1 igand activity, ant i- inflammatory 

CC activity, cadherin/tumour invasion suppressor activity, and tumour 

CC inhibition activity. It is also stated to be useful for gene 

CC therapy . 

XX 

SQ Sequence 448 AA; 



Query Match 100.0%; Score 2407; DB 20; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 


60 






II I 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I II 1 1 1 I I 1 1 1 1 1 1 1 1 1 




Db 


26 


OCTNGFDLDROSGOPLDI DECRTI PFAPRGDMMPVNONGGYLPT PRTNTPVYRGPY^NPY^ 

y^f V— X Xi \J 1 l^XJi^/l\^LJ\J ^£ \rf 1 1 J 1 ■ * X IS Wi\ X X- X UnVwriVVj iyl 11 IV^ V Ir* V^/L 1 * \J\J X XJ X X 1\ X XM XT V X I\\J X X Ulv X X 


ft R 


Qy 


61 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 M 




Db 


86 


TPYSGPYPAAAPPLSAPNYPTI ^RPLTrRFnYOMDE^MnrvnVDFrATn^HnrMPTOTrT 


1 4R 

JL *± _) 


Qy 


121 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


146 


NTEGGYTCSCTDGYWLLEGOCLDIDECRYGYCOOLCANVPGSYSCTCNPGFTLNEDGRSr 


w u J 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


OD VN E GAT E N P P VOTP VNTT YG S F T PR PD PP Y F T .F F DP\7T-T P c; DM D F P C! FCIFFT .PDHFPVNm 

" XM i—i x XJXV x v_ V ^ X V_ V XN X X VJhJX X V^.I\. v_X^' x vJJ X XjXJLjX-iXJO V XiV_iZ3 LJl ILy Hi ^ Or OD" XJ v_l^n Xj V V xN w 


zoo 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


300 






1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 INI 




Db 


266 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


325 




Jul 


Kl oUNKL^lYlUrACjiNx^UL-KUyJrr 1 1 ijlRxJlYiUv VooKo V r'AUl r (JMyAx 1 Kir\jAY Y 1 r Ql Kb 


~i s~ n, 

360 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 


385 


Qy 


361 


GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 


420 






m ii iii iii ir ii in iiniiiii m mi ininii M 




Db 


386 


GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 


445 


Qy 


421 


YPF 423 




Db 


446 


III 

YPF 448 





RESULT 3 
AAW94281 

ID AAW94281 standard; Protein; 448 AA. 
XX 

AC AAW94281; 
XX 

DT 07-MAY-1999 (first entry) 
XX 

DE Human extracellular matrix protein (ECMP)-l. 
XX 

KW Extracellular matrix protein; ECMP-1; diagnosis; treatment; cancer; 

KW immune disorder; human. 

XX 

OS Homo sapiens . 
XX 

PN WO9900410-A2 . 
XX 

PD 07-JAN-1999. 
XX 

PF 23-JUN-1998; 98WO-US13012 . 



XX 

PR 27-JUN-1997; 97US- 0884 072 . 
XX 

PA (INCY-) INCYTE PHARM INC. 
XX 

PI Bandman 0, Cor ley NC, Guegler KJ; 
XX 

DR WPI; 1999-095674/08. 
DR N-PSDB; AAX05359. 
XX 

PT New polynucleotide encoding extracellular matrix protein, ECMP-1 - 
PT useful in the diagnosis, prevention and treatment of immune 
PT disorders and cancer 
XX 

PS Claim 1; Fig 1A-G; 79pp; English. 
XX 

CC This represents a human extracellular matrix protein (ECMP)-l. Host 

CC cells containing a vector comprising the ECMP-1 nucleic acid are used 

CC for the recombinant production of the protein. ECMP-1 and its 

CC (ant) agonists , are useful in the diagnosis, prevention, and treatment 

CC of cancer and immune disorders. 

XX 

SQ Sequence 44 8 AA; 

Query Match 100.0%; Score 2407; DB 20; Length 44 8; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II II I II II II II INI Mill II MM Mill II II II II 1 1 III II MM III II II I 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

II I'll MM III II Ml II ; IN MUM M INIUM, II 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

II Ml II M MMMMMI IM II. Mi MIMIMMI I II 

Db 146 NTEGG YTCS CTDG YWLLEGQCLD I DECR YG YCQQLCANVPGS YS CTCN PGFTLNEDGRS C 205 

Qy 181 QDWECATENPCVQTCWTYGSFICRCDPGYELEEDGVTiCSDMDECSFSEFLCQHECVNQ 24 0 

M M I I I I I I I I I II M I I I M I I II II I I I II I I I I I I II II II I I II II I I I I I I II I 
Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

M M M I I I I I I I M I I M M I I I I I I I I I II I I M I II I II II I I I I II I I I II I I I I I 
Db 266 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

MMMMMMMMMII Ml III IliMMMIMMIMMMMIMM 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 420 

I M I M M I I I I I I I I I I I I I I I II II II I I I I I I I I II M I I I I I I I I II I I I II I I II 
Db 386 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 445 

Qy 421 YPF 423 



Db 



III 

446 YPF 448 



RESULT 4 
AAY57058 

ID AAY57058 standard; Protein; 448 AA. 
XX 

AC AAY57058; 
XX 

DT 21-FEB-2000 (first entry) 
XX 

DE Amino acid sequence of the human secreted protein AK647. 
XX 

KW AK647; aortic tissue development; smooth muscle cell modulator; SCID; 

KW nutritional supplement; vasculogenesis ; embryonic development; infection; 

KW cytokine activity; cell proliferation; cell differentiation; detect; HIV; 

KW immune deficiency; haematopoiesis regulation; tissue regrowth; diagnose; 

KW wound healing; restenosis; atherosclerosis; drug screen. 

XX 

OS Homo sapiens . 
XX 

PN WO9960125-A2. 
XX 

PD 25-NOV-1999. 
XX 

PF 18-MAY-1999; 99WO-US10931 . 
XX 

PR 19-MAY-1998; 98US-0081002 . 

PR 21-MAY-1998; 98US- 0083 002 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, McCoy JM, Racie L, LaVallie E, Treacy M, Evans C; 

PI Agostino M, Lu Z, Merberg D; 

XX 

DR WPI; 2000-053298/04. 

DR N-PSDB; AAZ39892. 
XX 

PT Proteins, and their encoding polynucleotides, used for treating e.g. 

PT smooth muscle cell growth, vasculogenesis, restenosis or 

PT atherosclerosis 
XX 

PS Claim 4; Page 46-47; 49pp; English. 
XX 

CC This is the amino acid sequence of the human secreted protein AK647. The 

CC polynucleotide sequence was obtained from a human foetal kidney cDNA 

CC library. AK647 homologues in chicks and rodents are involved in aortic 

CC tissue development. The spatial and temporal distribution of AK647 

CC indicated that it acts as an a modulator of smooth muscle cells in 

CC vasculogenesis during embryonic development. The primary structure of 

CC AK647 consists of multiple EGF domains. The AK647 protein can be used as 

CC a nutritional source or supplement. The protein shows both inhibitory and 

CC inducing, cytokine, cell proliferation and cell differentiation activity. 

CC The protein may also be used in the treatment of immune deficiencies and 

CC disorders, including severe combined immunodeficiency (SCID) , HIV and 

CC other viral, bacterial and fungal infections. Regulation of immune 



CC responses may also be carried out by the AK647 protein. Other uses of the 

CC protein include a role in the regulation of haematopoiesis and in the 

CC treatment of myeloid and lymphoid cell deficiencies. Uses in bone, 

CC cartilage, tendon, ligament and nerve tissue regrowth are also possible, 

CC as well as for wound healing and in the treatment of ulcers and burns. 

CC The polynucleotides and proteins can be used for preventing, treating or 

CC ameliorating smooth muscle cell growth, vasculogenesis , restenosis, 

CC atherosclerosis, blood vessel remodelling and degeneration. The proteins 

CC may also have activin/inhibin, chemotactic/chemokinetic, haemostatic and 

CC thrombolytic, receptor/ligand, anti -inflammatory, cadher in/ tumour 

CC invasion suppressor, and tumour inhibition activity. AK647 specific 

CC antibodies can be used for promoting smooth muscle cell growth or 

CC vasculogenesis. The proteins and polynucleotides can also be used for 

CC detection, diagnosis and drug screening. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2407; DB 21; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
QY 1 QCTNGFDLDRQSGQCLD I DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60' 

I ,.11 II Mi II M II ,1 ' II Ml II ,IM II' Mi I 

Db 26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

M M 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 II: 1 1 ! , 1 M 1 1 1 1 , 1 i l ! 1 1 M 1 1 M I 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIMIMIIIIIIIIIIIM 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

MIIIIIIIIMIIMIIIIIIIIMIIMIIIIMIIIIIIIIIIIIIIIIIMIIMI 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 300 

1 1 II I II I II II II II II I III 1 1 MINI II 1 1 II III MM MM II INI I III III 

Db 266 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQIKS 360 

IMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

Db 326 R I SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 420 

MIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 386 GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 5 
AAY54 989 

ID AAY54989 standard; Protein; 448 AA. 



XX 

AC AAY5498 9; 
XX 

DT 15-FEB-2000 (first entry) 
XX 

DE Full length human A55 protein sequence. 
XX 

KW A55 protein; human; smooth muscle proliferation; tissue generation; 

KW vascular smooth muscle cell; arteriosclerosis; tissue reparation; myoma; 

KW vascular endothelial thickening; haematopoietic cell-regulator; cytokine; 

KW percutaneous transluminal coronary angioplasty; blood coagulation; PTCA; 

KW actin; inhibin; chemotaxis; thrombosis; cadherin; therapy; 

KW tumour metastasis inhibitor. 

XX 

OS Homo sapiens. 
XX 

PN W09955864-A1. 
XX 

PD 04-NOV-1999. 
XX 

PF 28-APR-1999; 99WO-JP02284 . 
XX 

PR 28-APR-1998; 98 JP-0119731 . 
XX 

PA (ONOY ) 0N0 PHARM CO LTD. 
XX 

PI Honjo T, Tashiro K, Nakamura T; 
XX 

DR WPI; 2000-038647/03. 

DR N-PSDB; AAZ40027. 
XX 

PT Novel human polypeptides for treatment of, e.g. arteriosclerosis and 

PT myoma - 

XX 

PS Claim 1; Page 76-80; 87pp ; Japanese. 
XX 

CC This sequence is the human A55 protein of the invention. The protein 

CC can be used for the treatment of diseases due to abnormal proliferation 

CC of smooth muscle. The polypeptides can be used according their inhibition 

CC of the proliferation of vascular smooth muscle cells, particularly in 

CC treating arteriosclerosis or re-narrowing by vascular endothelial 

CC thickening after percutaneous transluminal coronary angioplasty (PTCA) , 

CC or myoma, haematopoietic cell -regulatory activity, cytokine activity, 

CC tissue generation/reparation activity, actin/inhibin activity, taxis 

CC and chemotaxis activity, blood coagulation/ thrombotic activity, 

CC receptor/1 igand activity, cadherin/tumour metastasis inhibiting activity; 

CC tumour inhibition, and as a nutrient. 

XX 

SQ Sequence 44 8 AA; 



Query Match 100.0%; Score 2407; DB 21; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

MIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

IIIIMIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

MIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIMIMIIMIIIMIIIIIIIIMI 

NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

1 1 1 1 1 1 1 1 1 £ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I J 1 1 i 1 1 1 1 ! 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 ! 

QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 300 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIII 

PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIIIIIIIIIIIII 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

GNEGREF YMRQTG P I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 42 0 

II Ml Mlllll II II II ,111 III! II | II iM'MM'M 

GNEGREFYMRQTGPISATLVMTRPI KGPREIQLDLEMITVNTVINFRGSSVI RLR I YVSQ 445 



II 



RESULT 6 




AAM93573 




ID 


AAM93573 standard; Protein; 448 AA. 




XX 






AC 


AAM93573; 




XX 






DT 


06-NOV-2001 {first entry) 




XX 






DE 


Human polypeptide, SEQ ID NO: 3357. 




XX 






KW 


Human; full length cDNA; cDNA synthesis; 


oligo-capping. 


XX 






OS 


Homo sapiens. 




XX 






PN 


EP1130094-A2 . 




XX 






PD 


05-SEP-2001 . 




XX 






PF 


07-JUL-2000; 2 000EP- 0 1 14 08 9 . 




XX 






PR 


08-JUL-1999; 99JP-0194486 . 




PR 


ll-JAN-2000; 2 000 JP- 0 118774 . 




PR 


02-MAY-2000; 2 000JP- 0183765 . 




XX 






PA 


(HELI-) HELIX RES INST. 




XX 






PI 


Ota T, Nishikawa T, Isogai T, Hayashi 


K, Ishii S, Kawai Y; 



PI Wakamatsu A, Sugiyama T, Nagai K, Ko j ima S, Otsuki T, Koga H; 
XX 

DR WPI; 2001-524255/58. 

DR N-PSDB; AAK94505. 
XX 

PT 830 Primers useful for synthesizing full length cDNA clones and their 

PT use in genetic manipulation - 

XX 

PS Claim 8; SEQ ID NO 3357; 1380pp + sequence listing; English. 
XX 

CC The invention relates to primers for synthesising full length cDNA 

CC clones. 830 cDNA molecules encoding a human protein have been 

CC isolated and nucleotide sequences of 5 1 - and 3 1 -ends of the cDNA 

CC molecules have been determined. Primers for synthesising the full length 

CC cDNA are useful for clarifying the function of the protein encoded by 

CC the cDNA. The full length clones were obtained by construction of full 

CC length enriched cDNA libraries that were synthesised by the oligo-capping 

CC method. The primers enable the production of the full length cDNA easily 

CC without any special methods. The present sequence is a polypeptide 

CC encoded by a full length human cDNA of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in CD-ROM format directly from EPO. 

XX 

SQ Sequence 448 AA; 

Query Match 100.0%; Score 2407; DB 22; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I ' M . I !! I U II I ,1 II , : l II 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

Ml II IMIMi I! II II MM III II MM MINN II I 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 14 5 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IMIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMM 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

. ii i ii. i i m i, i i: i iniii m i 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 2 65 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIII 

Db 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KS 360 

II Ml IIMIM II llllll IM, III II MM IIIIIIMII Ml 

Db 326 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 420 

IMMMIMI IMIMIMI IMIIMMii || in imjm || 

Db 386 GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 445 



Qy 

Db 



421 YPF 423 
III 

446 YPF 448 



RESULT 7 
AAU75494 

ID AAU75494 standard; Protein; 448 AA. 
XX 

AC AAU754 94; 
XX 

DT 23-APR-2002 (first entry) 
XX 

DE Human extracellular protein-like/EGF-like protein, EEGF . 
XX 

KW Human; extracellular protein-like protein; EGF-like; 

KW protein; epidermal growth factor; EEGF; ATCC 97285; gene therapy; 

KW vascular smooth muscle cell proliferation; Marfan syndrome; 

KW wound healing; neurological trauma; acquired immunodeficiency syndrome; 

KW AIDS-related dementia; ocular disorder; kidney disorder; liver disorder ; 

KW hair follicle growth promotion; burn; ulcer; corneal incision; 

KW corneal inflammation; neoplasm; psoriasis. 

XX 

0S~ Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Peptide 1 . .25 

FT /label = Signal_peptide 

FT Protein 26.. 448 

FT / label = Mature_EEGF 

FT /note= "This region is specifically claimed in claim 10" 

FT Domain 112 . . 153 

FT /label = EGF_l_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 154 . . 190 

FT /label= EGF_2_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 191. .23 0 

FT /label = EGF_3_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 231. .271 

FT /label= EGF_4_domain 

FT /note= "This domain is specifically claimed in claim 10" 

FT Domain 272 . .314 

FT /label = EGF_5_domain 

FT /note= "This domain is specifically claimed in claim 10" 

XX 



PN US2001051358-A1. 
XX 

PD 13-DEC-2001. 
XX 

PF 25-MAR-1999; 99US-0275805 . 
XX 

PR ll-APR-1997; 97US-0839525 . 

PR 10-APR-1996; 96WO-US05247 . 
XX 

PA (OLSE/) OLSEN H S. 



PA (LIHH/) LI H. 
XX 



PI Olsen HS, Li H; 
XX 

DR WPI; 2002-121417/16. 

DR N-PSDB; ABK13627. 
XX 

PT New nucleic acid encoding human extracellular/epidermal growth factor, 

PT useful for diagnosis and treatment of e.g. Marfan syndrome and wounds, 

PT also related polypeptides 
XX 

PS Claim 10; Fig 1; 22pp; English. 
XX 

CC The invention relates to a novel polynucleotide which is at 

CC least 95% identical with a sequence (ATCC 97285) encoding mature human 

CC extracellular protein-like/epidermal growth factor (EGF)-like protein, 

CC EEGF. Also included are the EEGF EGF domains, a vector containing 

CC the polynucleotide, a host cell containing the vector, anti-EEGF 

CC antibodies and antagonists of EEGF. The polynucleotide is used for 

CC recombinant production of EEGF, in gene therapy, as hybridisation probes, 

CC as antisense antagonists and for chromosome identification. The protein 

CC is used to treat patients who require EEGF, to identify specific 

CC antagonists, used to treat conditions that require inhibition of EEGF 

CC (e.g. vascular smooth muscle cell proliferation, Marfan syndrome, wound 

CC healing, neurological trauma, acquired immunodeficiency syndrome 

CC (AIDS) -related dementia, ocular disorders, kidney disorders, liver 

CC disorders, hair follicle growth promotion, burns, ulcers, corneal 

CC incisions, corneal inflammation, neoplasms and psoriasis), to raise 

CC specific antibodies and to characterise receptors. The present 

CC sequence represents EEGF. 

XX 

SQ Sequence 44 8 AA; 

Query Match 100.0%; Score 2407; DB 23; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.2e-159; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

Ml Ml MMMM II ill, II MM 11,111 IN MUM III,, 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IMIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 14 5 

Qy 121 NTEGGYTCSCTDGYVJLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

III 1 1 III II MINI II MM MM III MINI IN MINIMI MINIM Mill 

Db 14 6 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 2 05 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

Ml M MM II .11, 'llllll MINIMUM II' Ml 11,1 Mill 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIMIIMIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 



Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIMIIIIIIIIIMIIIIIIIM 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQ 420 

IMIIIIIIMIMMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIM 

Db 386 GNEGREF YMRQTGPI SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQ 445 

Qy 421 YPF 423 
III 

Db 446 YPF 448 

RESULT 8 
AAY08063 

ID AAY08063 standard; Protein; 448 AA. 
XX 

AC AAY08063; 
XX 

DT ll-SEP-2000 (first entry) 
XX 

DE Human EGF-like homologue protein (PR0217) encoded by DNA32279 cDNA. 
XX 

KW Inflammatory cell infiltration; immune response; T cell proliferation; 

KW anti- inflammatory; ant i -autoimmune; ant i -diabetic; spondyloarthropathy; 

KW T cell -mediated disease; spondyloarthropathy; sclerosis; renal disease; 

KW inflammatory myopathy; hemolytic anemia; thrombocytopenia; thyroiditis; 

KW diabetes mellitus; demyelinating polyneuropathy; Guillain-Barre syndrome; 

KW multiple sclerosis; polyneuropathy; hepatitis; cirrhosis; enteropathy; 

KW sclerosing cholangitis; inflammatory bowel disease; Whipple's disease; 

KW skin disease; dermatitis; psoriasis; asthma; allergic rhinitis; tumor; 

KW food hypersensitivity; urticaria; eosinophilic pneumonia; transplant; 

KW idiopathic pulmonary fibrosis; graft rejection; PR0245; human; PR0217; 

KW EGF-like. 
XX 

OS Homo sapiens. 
XX 

PN W09914241-A2. 
XX 

PD 25-MAR-1999 . 
XX 

PF 17-SEP-1998 ; 98WO-US19437 . 
XX 

PR 17 -SEP- 19 97; 97US-0059119 . 

PR 18 -SEP- 1997; 97US- 0059263 . 

PR 28 -OCT- 1997; 97US- 0063550 . 

PR 12-NOV-1997; 97US-0065186 . 

PR 21 -NOV- 19 97; 97US- 0066364 . 

PR 24 -NOV- 19 97; 97US- 0066770 . 

PR 04-JUN-1998 ; 98US- 008 8026 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Fong S, Goddard A, Gurney AL, Tumas D, Wood WI ; 
XX 

DR WPI; 1999-229499/19. 

DR N-PSDB; AAX37670 . 



XX 

PT Composition containing novel polypeptide PR0245, its agonist or 

PT antagonist - 

XX 

PS Example 1; Fig 6A; 177pp ; English. 
XX 

CC This invention describes a novel composition containing (apart from a 

CC carrier or excipient) , a novel PR0245 polypeptide (I) # its agonist or 

CC antagonist, or their fragments, for modulating: (i) infiltration of 

CC inflammatory cells into tissue; (ii) an immune response; or (iii) T cell 

CC proliferation. The composition increases or decreases any of the effects 

CC (i)-(iii). The products of the invention have anti-inflammatory, 

CC ant i -autoimmune and anti-diabetic activity. (I), and its (ant ) agonists 

CC and their fragments, are used to treat immune-related diseases, 

CC particularly T cell -mediated diseases. The diseases treated include 

CC systemic lupus erythematosus, rheumatoid arthritis, juvenile chronic 

CC arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), 

CC idiopathic inflammatory myopathies (dermatomyositis , polymyositis), 

CC Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune 

CC hemolytic anemia (immune pancytopenia, paroxysmal nocturnal 

CC hemoglobinuria) , autoimmune thrombocytopenia (idiopathic thrombocytopenic 

CC purpura immune-mediated thrombocytopenia), thyroiditis (Grave's disease, 

CC Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic 

CC thyroiditis), diabetes mellitus, immune-mediated renal disease 

CC (glomerulonephritis, tubulointerstitial nephritis) , multiple sclerosis, 

CC idiopathic demyelinat ing polyneuropathy, Guillain-Barre syndrome, chronic 

CC inflammatory demyelinating polyneuropathy, infectious hepatitis 

CC (hepatitis A, B, C, D, E and other non-hepatotropic viruses) , autoimmune 

CC chronic active hepatitis, primary biliary cirrhosis, granulomatous 

CC hepatitis, and sclerosing cholangitis, inflammatory bowel disease 

CC (ulcerative colitis: Crohn's disease), gluten-sensitive enteropathy, and 

CC Whipple's disease. Autoimmune or immune-mediated skin diseases including 

CC bullous skin diseases, erythema multiforme, contact dermatitis, psoriasis, 

CC asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity, 

CC urticaria, eosinophilic pneumonia, idiopathic pulmonary fibrosis, 

CC hypersensitivity pneumonitis, and transplantation associated diseases 

CC (graft rejection, and graf t-versus-host-disease) . (I), its (ant ) agonists 

CC or fragment can also be used as an adjuvant in treatment of tumors. 

CC Antibodies against (I) can also be used for diagnosing such diseases. 

CC This sequence represents a human EGF-like homologue encoded by cDNA clone 

CC DNA32279 which is described in the invention. 

XX 

SQ Sequence 448 AA; 

Query Match 99.8%; Score 2401; DB 20; Length 448; 

Best Local Similarity 99.8%; Pred. No. 3.2e-159; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIMIIIIIIMIIIIIIIMIIIII 

Db 26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

I II Ml II MINI MM llllllll MM II I II llllllll llllllll MINI Ml 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 



Db 
Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



146 



181 



1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 ii 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 

NTEGG YTCS CTDG YWLLEGQCLD I DECR YG YCQQLCANVPGS YS CTCN PGFTLNEDGRS C 



241 



301 



361 



205 



240 



QDVNECATENPCVQTCVNTYGS F I CRCDPG YELEEDGVHCSDMDECS FSEFLCQHECVNQ 

llllllllll.l lllllllll 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

206 QDWECATENPCVQTCVNTYGSL I CRCDPG YELEEDGVHCSDMDECS FSEFLCQHECVNQ 265 



300 



PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

266 PGTYFCSCPPGY I LLDDNRSCQD I NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 325 



360 



RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTR YPGAYY I FQI KS 

III I! MMMI IMM! Ml Ml !! IMM II MM II IMMI 

326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTR Y PGAY Y I FQ I KS 385 



420 



GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 

I IIMI IMMI Ml MMMI IM IMMM M MMMIMII 

386 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSSVI RLR I YVSQ 445 

421 YPF 423 
III 

446 YPF 448 



RESULT 9 
AAU29227 
ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD, 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



AAU29227 standard; Protein; 448 AA. 
AAU2 9227; 

18-DEC-2001 (first entry) 
Human PRO polypeptide sequence #204. 

PRO polypeptide; mammal; tumour; cancer; human; cattle; horse; sheep; 
dog; cat; pig; goat; rabbit; tumour necrosis factor alpha; TNF-alpha; 
blood; chondrocyte cell; cell proliferation; .cell differentiation; colon; 
adrenal; lung; breast; prostate; rectum; cervix; liver; genetic disorder. 

Homo sapiens . 

WO200168848-A2 . 

20-SEP-2001. 

28-FEB-2001; 2001WO-US06520 . 



01- MAR-2000 

02- MAR-2000 

03- MAR-2000 
06-MAR-2000 
14-MAR-2000 

14- MAR-2000 

15- MAR-2000 
21-MAR-2000 
21-MAR-2000 
21-MAR-2000 



2000WO-US05601. 
2000WO-US05841. 
2000US-187202P. 
2000US-186968P. 
2000US-189320P. 
2000US-189328P. 
2000WO-US06884 . 
2000US-190828P. 
2000US-191007P. 
2000US-191048P. 



PR 21-MAR-2000; 2000US-191314P . 

PR 28-MAR-2000; 2000US-192655P . 

PR 29-MAR-2000; 2000US-193032P . 

PR 29-MAR-2000; 2000US-193053P . 

PR 30-MAR-2000; 2000WO-US08439 . 

PR 04-APR-2000; 2000US-194449P. 

PR 04-APR-2000; 2000US-194647P . 

PR ll-APR-2000; 2000US-195975P . 

PR ll-APR-2000; 2000US-196000P . 

PR ll-APR-2000; 2000US-196187P . 

PR ll-APR-2000; 2000US-196690P . 

PR ll-APR-2000; 2000US-196820P . 

PR 18-APR-2000; 2000US- 198 12 IP . 

PR 18-APR-2000; 2000US- 198585P . 

PR 25-APR-2000; 2000US- 199397P . 

PR 25-APR-2000; 2 OOOUS - 1 99550P . 

PR 25-APR-2000; 2000US-199654P . 

PR 03-MAY-2000; 2 000US-2 01516P . 

PR 17-MAY-2000; 2 00 0WO-US13705 . 
PR . 22-MAY-2000; 2 000WO-US14042 . 

PR 30-MAY-2000; 2000WO-US14941 . 

PR 02-JUN-2000; 2 000WO-US15264 . 

PR 05-JUN-2000; 2 00 OUS -2 09832P . 

PR 28-JUL-2000; 2 000WO-US20710 . 

PR 22-AUG-2000; 2000US-0644848 . 

PR 24-AUG-2000; 2000WO-US23328 . 

PR 08-NOV-2000; 2 000WO-US3 0952 . 

PR 01-DEC-2000; 2 000WO-US32678 . 

PR 20-DEC-2000; 2000WO-US34956 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Chen J, Desnoyers L, Goddard A, Godowski PJ, Gurney AL; 

PI Pan J, Smith V, Watanabe CK # Wood WI , Zhang Z; 

XX 

DR WPI; 2001-602746/68. 

DR N-PSDB; AAS46128 . 
XX 

PT Novel nucleic acids encoding PRO polypeptides, used to diagnose the 

PT presence of tumours, such as prostate and breast tumours, in mammals and 

PT to screen for modulators of the compounds - 

XX 

PS Claim 11; Fig 408; 774pp ; English. 
XX 

CC Sequences AAU29024 -AAU2 9328 represent PRO polypeptides of the invention. 

CC The PRO polypeptides and their associated nucleic acids can be used to 

CC detect the presence of a tumour in a mammal by comparing the level of 

CC expression of a PRO polypeptide in a test sample of cells from the animal 

CC and a control sample of normal cells, whereby a higher level of 

CC expression in the test sample indicates the presence of a tumour in the 

CC mammal. Mammals include dogs, cats, cattle, horses, sheep, pigs, goats 

CC and rabbits but are preferably human. The polypeptides can be used to 

CC stimulate tumour necrosis factor (TNF) alpha release from human blood, 

CC when contacted with it. A specific polypeptide can be used to stimulate 

CC the proliferation or differentiation of chondrocyte cells. The PRO 

CC proteins can be used to determine the presence of tumours and also 

CC susceptibility to tumour development, particularly adrenal, lung, colon, 



CC breast, prostate, rectal, cervical, or liver tumours, in mammalian 

CC subjects. The oligonucleotide probes specific for the PRO nucleic acids 

CC can be used for genetic analysis of individuals with genetic disorders. 
XX 

SQ Sequence 448 AA; 

Query Match 99.8%; Score 2401; DB 22; Length 448; 

Best Local Similarity 99.8%; Pred. No. 3.2e-159; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I MM II MM II II I II MM II MINIM MM II MM III II MINI IMIIII 

Db 26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 12 0 

III II II M II IM II II Ml II I IMMI Ml II 

Db 86 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

II MMMI M MMMI Ml Ml, IIMIMMI III MMIIIMM 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 2 05 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

MM II Mull, I I II! II II II M Ml III Ml Ml 

Db 2 06 QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 300 

IIIIIIIIIIIIIIIIMMIIIIIIMIIIIMIIIIMIIMIIIIIIIIIIIIIIM 

Db 266 PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

MM II MMIM M III IM II Ml MM IM I MMMI MM 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTG P I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ 420 

Ml II Ml MMM II III MMM II III II Mi II: II. 

Db 386 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 10 
AAB31183 

ID AAB31183 standard; Protein; 448 AA. 
XX 

AC ( AAB31183; 

XX ' 

DT 20-APR-2001 (first entry) 
XX 

DE Amino acid sequence of human polypeptide PRO210. 
XX 

KW Human; secreted protein; transmembrane protein; PR0196; PR0444; PR0183; 

KW PR0185; PRO210; PR0215; PR0217; PR0242; PR0288; PR0365; PR01361; PRO1308; 

KW PR01183; PR01272; PR01419; PR04999; PRO7170; PR0248; PR0353; PR01318; 

KW PRO1600; PRO9940; PR0533; PRO301; PR0187; PR0337; PR01411; PR04356; 



KW PR0246; PR0265; PR0941; PRO10096; PRO6003; PRO6004; PR0350; PRO2630; 
KW PRO6309; cell death; genetic disorder; transgenic animal; gene therapy. 



XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Peptide 1..25 

FT /note= "signal peptide" 

FT Modif ied-site 21.. 27 

FT /note= "N-myristoylation site" 

FT Binding-site 54.. 57 

FT /note= "cell attachment site" 

FT Modif ied-site 64.. 70 

FT /note= "N-myristoylation site" 

FT Modif ied-site 144 . . 156 

FT /note= "aspartic acid and asparagine hydroxylation site" 

FT Modif ied-site 149.. 155 

FT /note= "N-myristoylation site" 

FT Modif ied-site 186.. 192 

FT /note= "N-myristoylation site" 

FT Modif ied-site 226.. 232 

FT /note= "N-myristoylation site" 

FT Modif ied-site 242.. 248 

FT /note= "N-myristoylation site" 

FT Modif ied-site 267.. 273 

FT /note= "N-myristoylation site" 

FT Modif ied-site 283.. 287 

FT /note= "N-glycosylation site" 

FT Modif ied-site 296.. 300 

FT /note= "N-glycosylation site" 

FT Modif ied-site 310.. 316 

FT /note= "N-myristoylation site" 
XX 



PN WO200077037-A2 . 
XX 



PD 


21 


-DEC- 


2000 






XX 












PF 
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-MAY- 


2000; 2000WO 


-US14042 


XX 
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1999 


; 99WO 
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1999 


99WO 
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15 


-SEP- 


1999 


99WO 


-US21090 


PR 


15 


-SEP- 


1999 


99WO 


-US21547 


PR 


30 


-NOV- 


1999 


99WO 


-US28313 


PR 


01 


-DEC- 


1999 


9 9WO 


-US28301 


PR 


02 


-DEC- 


1999 


99WO 


-US28565 


PR 


07 


-DEC- 


1999 


99US 


-0169495 


PR 


05 


-JAN- 


2000 


• 2000WO 


-US00219 


PR 


18 


-FEB- 


2000, 


• 2000WO 


-US04341 


PR 


18 


-FEB- 


2000, 


- 2000WO 


-US04342 


PR 


22 


-FEB- 


2000, 


2000WO 


-US04414 


PR 


01 


-MAR- 


2000, 


2000WO 


-US05601 


PR 


02 


-MAR- 


2000, 


2000WO 


-US05841 



PR 20-MAR-2000; 2000WO-US07377 . 

PR 30-MAR-2000; 2000WO-US08439 . 

PR 15-MAY-2000; 2000WO-US13358 . 

PR 17-MAY-2000; 2000WO-US13705 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi AJ, Baker KP, Botstein DA, Desnoyers L, Eaton DL; 

PI Ferrara N, Fong S, Gao W, Gerber H, Gerritsen ME, Goddard A; 

PI Godowski PJ, Gurney AL, Kljavin IJ, Mather JP, Napier MA, Pan J; 

PI Paoni NF, Roy MA, Stewart TA, Tumas D, Watanabe CK, Williams PM; 

PI Wood WI, Zhang Z; 

XX 

DR WPI; 2001-050091/06. 

DR N-PSDB; AAC86968. 
XX 

PT Isolated nucleic acid molecule encoding a PRO polypeptide which is a 

PT transmembrane polypeptide is useful for gene therapy and identification 

PT of related polypeptides - 
XX 

PS Claim 12; Fig 10; 244pp ; English. 
XX 

CC The present sequence represents a human secreted and transmembrane 

CC polypeptide. The specification describes human polypeptides, designated 

CC PR0196, PR0444, PR0183, PR0185, PRO210, PR0215, PR0217, PR0242, PR0288, 

CC PR0365, PR01361, PRO1308, PR01183, PR01272, PR01419, PR04999, PRO7170, 

CC PR0248, PR0353, PR01318, PRO1600, PRO9940, PR0533, PRO301, PR0187, 

CC PR0337, PR01411, PR04356, PR0246, PR0265, PR0941, PRO10096, PRO6003, 

CC PRO6004, PRO350, PRO2630 and PRO6309. The biological activity of cells 

CC can be modulated with agents that bind to these polypeptides, resulting 

CC in the death of the cells. The polynucleotides encoding these 

CC polypeptides are useful in the recombinant production of the 

CC polypeptides, as a hybridisation probe to screen libraries to isolate 

CC homologous sequences, or to map the gene. They may also be used for 

CC analysing genetic disorders, and to produce transgenic animals which are 

CC useful for the development and screening of therapeutically useful 

CC reagents. The polynucleotides can also be used in gene therapy e.g. to 

CC replace a defective gene. 

XX 

SQ Sequence 448 AA; 

Query Match 99.8%; Score 2401; DB 22; Length 448; 

Best Local Similarity 99.8%; Pred. No. 3.2e-159; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IMIIIIMIIIIIMIMMIIIIIIIIIIIIIIIMIIMIIIIMIMMIIIIMI 

Db 2 6 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

QY 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIIIIIIMMIIMIIIIIMIIIIIIIIIMIIIIIIMIIIIIIIIMIII 

Db 14 6 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 



Qy 


181 


QDWECATENPCVQTCWTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 






Illlllllllllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 


Db 


£i \J O 


onvNEPATENPPvoTPVNTYf^T.TPRpnpnvFT.FFnnvuPQnMnF o £c; 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 






MIMIIIIIIII MIMIEIIIIIIII IIIMIII III 1 III llllllll IIIIIIIM 


Db 


266 


PGTYFP9PPPGYT T J .DDNR ^POnTNFPFHPNHTPNIT .DPTPYMT .OPPRKPT DPT PPFFPVT ■ 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 






IIIIIIMIIMIIIMIIIIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIIMIM 


Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 




J b 1 


LjNbCjKhr iMRy lLiPIbATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 






IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIMIIMIIIII 


Db 


386 


GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 445 


Qy 


421 


YPF 423 


Db 


446 


III 

YPF 448 



RESULT 11 
ABU71315 

ID ABU71315 standard; Protein; 448 AA. 
XX 

AC ABU71315; 
XX 

DT 10-JUN-2003 (first entry) 
XX 

DE Human PRO210 protein. 
XX 

KW Human; PRO; secreted; transmembrane; cytostatic; TNF-alpha; blood; 

KW tumour necrosis factor alpha release; chondrocyte cell; proliferation; 

KW differentiation; tumour; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN US2003036143-A1. 
XX 
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PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi AJ, Baker KP, Botstein DA, Desnoyers L, Eaton DL; 

PI Ferrara N, Fong S, Gao W, Gerber H, Gerritsen ME, Goddard A; 

PI Godowski PJ, Gurney AL, Kljavin IJ, Mather JP, Napier MA, Pan J; 

PI Paoni NF, Roy MA, Stewart TA, Tumas D, Watanabe CK, Williams PM; 

PI Wood WI, Zhang Z; 

XX 

DR WPI; 2003-328482/31. 

DR N-PSDB; ACA60458. 
XX 

PT Novel secreted and transmembrane polypeptide for modulating biological 

PT activity of cell expressing the polypeptide, for identifying agonists 

PT or antagonists of polypeptide, and as molecular weight markers 
XX 

PS Claim 12; Fig 10; 254pp ; English. 
XX 

CC The invention describes an isolated, secreted and transmembrane 

CC polypeptide (PP) , termed PRO PP or fibroblast growth factor receptor PP 

CC (I). (I) is useful for detecting PR0533, PRO301, PR0187, PR0337, 

CC PR01411, PRO10096, PR0246, PRO6307, PRO6003, fibroblast growth factor 

CC receptor (FGFR) -3 , FGFR-4 , FGFR-1 , FGFR-2, PRO6004, PR04356, PRO2630, 

CC PR0265 or PR0951 polypeptide, and for linking a bioactive molecule to a 

CC cell expressing the above polypeptides. The bioactive molecule, a toxin, 

CC radiolabel or an antibody, causes cell death. PRO is useful in assays to 

CC identify other proteins or molecules involved in binding interaction. - 

CC The polynucleotide (II) encoding (I) is useful in chromosome and gene 

CC mapping, in generation of antisense RNA and DNA, for generating 

CC transgenic animals or knockout animals which in turn are useful in the 

CC development and screening of therapeutically useful reagents, to 

CC construct hybridisation probes for mapping the gene which encodes the 

CC PRO and for the genetic analysis of individuals with genetic disorders, 

CC in gene therapy, for chromosome identification and as a chromosome 

CC marker. (I) and (II) are useful for tissue typing. This is the amino 

CC acid sequence of a novel human secreted and transmembrane PRO 

CC polypeptide. 



XX 

SQ Sequence 44 8 AA; 



Query Match 99.8%; Score 2401; DB 24; Length 448; 

Best Local Similarity 99.8%; Pred. No. 3.2e-159; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIMIIIIIIIIIIIIII 

Db 2 6 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

IMMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 14 6 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 



Qy 181 QDVNECATENPCVQTCVNTYGSF I CRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

IMIIIIIMIIIMIIIIIII IIIMIIIIMIIIIIIIIIMIIIIIIIIMIIIII 

Db 206 QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMMIIIMIIIIIIMIIIIIIII 

Db 266 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 

Qy 3 01 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IMIIIIIIMIIIIIIIMIMMIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIM 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSS VI RLR I YVSQ 420 

IN II. II ill JIIMI Ml'iir II .IMi II MiMI II 

Db 386 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSS VI RLR I YVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 13 
ABU65772 

ID ABU65772 standard; Protein; 448 AA. 
XX 

AC ABU65772; 
XX 

DT 19-MAY-2003 (first entry) 
XX 

DE Human secreted/transmembrane protein, SEQ ID 408. 
XX 

KW Human; PRO; secreted protein; transmembrane protein; 

KW cytostatic; antiarthritic; osteopathic; adrenal tumour; lung tumour; 

KW colon tumour; breast tumour; prostate tumour; rectal tumour; 

KW cervical tumour; liver tumour; TNF-alpha release; arthritis; 

KW tumour necrosis factor alpha; chondrocyte cell; bone disorder; 

KW cartilage disorder; sports injury. 

XX 
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Homo sapiens . 
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Score 24 01; DB 24; 
Pred. No. 3.2e-159; 
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Length 448; 



Indels 
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1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 
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Db 
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QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 

IIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIMIIIIIMIIMI 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 



60 



85 



120 



TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 

IIIIIIIIIMIIIMIIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIMIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

II II . Mill! MM IMMI II Ml III MMMM IM IMiM 

NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 2 05 
QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECWQ 24 0 

MMIIIIIIIIIMIIIMII IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIMIIMIIMIIIIIIIIIIIIIIII 

PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 
RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 360 

MMIMMIM II, IMMI IMMMI [MMIMIIMIMMIMIIMM 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 385 
GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSS VI RLRIYVSQ 420 

It MM II MM IMMI Ml , I Ml IM II MM MlMM 

GNEGREF YMRQTGP I SATLVMTRP I KG PR EI QLDLEMI TVNTVI NFRGSS VI RLRIYVSQ 445 
YPF 423 



RESULT 14 
ABU66105 

ID ABU66105 standard; Protein; 448 AA. 
XX 

AC ABU66105; 



XX 

DT 20-MAY-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PRO210. 
XX 

KW Human; secreted protein; transmembrane protein; cytostatic; 

KW gene Therapy; TNF -Agonist -Alpha; chondrocyte stimulator; tumour; 

KW adrenal tumour; lung tumour; colon tumour; breast tumour; 

KW prostate tumour; rectal tumour; cervical tumour; liver tumour. 

XX 

OS Homo sapiens. 
XX 

PN US2003036157-A1. 
XX 

PD 20-FEB-2003. 
XX 

PF 02-JUL-2002; 2002US- 0188769 . 
XX 

PR 16-SEP-1998; 98WO-US1933 0 . 

PR 07-OCT-1998; 98W0-US21141 . 

PR 01-DEC-1998; 98WO-US25108 . 

PR 08-MAR-1999; 99WO-US05028 . 

PR 14-MAY-1999; 99WO-US10733 . 

PR 02-JUN-1999; 99WO-US12252 . 

PR 01-SEP-1999; 99WO-US20111 . 

PR 15-SEP-1999; 99WO-US21090 . 

PR 01-DEC-1999; 99WO-US28301 . 

PR 02-DEC-1999; 99WO-US28551 . 

PR 30-DEC-1999; 99WO-US3 1274 . 

PR 05-JAN-2000; 2000WO-US00219 . 

PR 18-FEB-2000; 2000WO-US0434 1 . 

PR 18-FEB-2000; 2000WO-US04342 . 

PR 22-FEB-2000; 2000WO-US04414 . 

PR 24-FEB-2000; 2000WO-US05004 . 

PR 01-MAR-2000; 2000WO-US05601 . 

PR 02-MAR-2000; 2000WO-US0584 1 . 

PR 15-MAR-2000; 2000WO-US06884 . 

PR 30-MAR-2000; 2000WO-US08439 . 

PR 17-MAY-2000; 2000WO-US13705 . 

PR 22-MAY-2000; 2000WO-US14 042 . 

PR 30-MAY-2000; 2000WO-US14 94 1 . 

PR 02-JUN-2000; 2000WO-US15264 . 

PR 28-JUL-2000; 2000WO-US20710 . 

PR 24-AUG-2000; 2000WO-US23328 . 

PR 08-NOV-2000; 2000WO-US3 0952 . 

PR 01-DEC-2000; 2000WO-US32678 . 

PR 20-DEC-2000; 2 000WO-US34956 . 

PR 28-FEB-2001; 2001WO-US06520 . 

PR 01-JUN-2001; 2001WO-US17800 . 

PR 20-JUN-2001; 2001WO-US19692 . 

PR 29-JUN-2001; 2001WO-US21066 . 

PR 09-JUL-2001; 2001WO-US21735 . 

PR 29-AUG-2001; 2 001WO-US27099 . 

PR 18 -SEP- 19 97; 97US-05 9263P . 

PR 18 -SEP- 19 97; 97US-05 9266P . 

PR 17 -OCT- 19 97; 97US- 0622 SOP . 

PR 21-OCT-1997; 97US-0634 86P . 



PR 24-OCT-1997; 97US-063120P . 

PR 24-OCT-1997; 97US-063121P . 

PR 28-OCT-1997; 97US-063540P . 

PR 28-OCT-1997; 97US-063541P . 

PR 28-OCT-1997; 97US-063544P . 

PR 28-OCT-1997; 97US-063564P . 

PR 29-OCT-1997; 97US-063734P . 

PR 31-OCT-1997; 97US-063870P . 

PR 31-OCT-1997; 97US-064103P . 

PR 13-NOV-1997; 97US-065311P . 

PR 21-NOV-1997; 97US- 06612 OP . 

PR 24-NOV-1997; 97US-066466P . 

PR 24-NOV-1997; 97US-066772P . 

PR ll-DEC-1997; 97US-069335P . 

PR 12-DEC-1997; 97US-069425P . 

PR 17-DEC-1997; 97US-069870P . 

PR 18-DEC-1997; 97US-068017P . 

PR 10-MAR-1998; 98US-077450P . 

PR ll-MAR-1998; 98US-077632P . 

PR ll-MAR-1998; 98US-077649P . 

PR 20-MAR-1998; 98US-078886P . 

PR 20-MAR-1998; 98US-078939P . 

PR 27-MAR-1998; 98US-079664P . 

PR 27-MAR-1998; 98US-079786P . 

PR 31-MAR-1998; 98US-080107P. 

PR 31-MAR-1998; 98US- 080194P . 

PR 01-APR-1998; 98US-080327P . 

PR 01-APR-1998; 98US-080333P . 

PR 08-APR-1998; 98US-081049P. 

PR 08-APR-1998; 98US-081070P. 

PR 09-APR-1998; 98US- 08 1 195P . 

PR 15-APR-1998; 98US-081838P . 

PR 21-APR-1998; 98US- 082568P . 

PR 21-APR-1998; 98US- 082569P . 

PR 22-APR-1998; 98US- 082704P . 

PR 22-APR-1998; 98US- 082797P . 

PR 28-APR-1998; 98US-083322P . 

PR 29-APR-1998; 98US- 0834 95P . 

PR 29-APR-1998; 98US- 0834 96P . 

PR 29-APR-1998; 98US-083499P . 

PR 29-APR-1998; 98US-083559P . 

PR 05-MAY-1998; 98US- 084366P . 

PR 06-MAY-1998; 98US-084414P . 

PR 07-MAY-1998; 98US-084639P . 

PR 07-MAY-1998; 98US-084640P . 

PR 07-MAY-1998; 98US-084643P . 

PR 15-MAY-1998;' 98US- 085579P . 

PR 15-MAY-1998 ; 98US-085580P . 

PR 15-MAY-1998; 98US-085582P . 

PR 15-MAY-1998; 98US-085700P . 

PR 18-MAY-1998; 98US-O86023P . 

PR 22 -MAY- 19 98; 98US-0863 92P . 

PR 22-MAY-1998; 98US-086486P . 

PR 28-MAY-1998; 98US-087098P . 

PR 28-MAY-1998; 98US-0872 08P . 

PR 02-JUN-1998; 98US-087609P . 

PR 02-JUN-1998; 98US-087759P . 



PR 03-JUN-1998; 98US-087827P . 

PR 04-JUN-1998; 98US-088025P . 

PR 04-JUN-1998; 98US-088028P . 

PR 04-JUN-1998; 98US-088029P . 

PR 04-JUN-1998; 98US-088 033P . 

PR 04-JUN-1998; 98US-088326P . 

PR 05-JUN-1998; 98US-088167P . 

PR 05-JUN-1998; 98US-088202P . 

PR 05-JUN-1998; 98US-088212P . 

PR 05-JUN-1998; 98US-088217P . 

PR 09-JUN-1998; 98US-088655P . 

PR 10-JUN-1998; 98US-088722P . 

PR 10-JUN-1998; 98US-088738P . 

PR 10-JUN-1998; 98US-08874 OP . 

PR 10-JUN-1998; 98US-088811P. 

PR 10-JUN-1998; 98US-088824P . 

PR 10-JUN-1998; 98US- 08 8 82 5P . 

PR 10-JUN-1998; 98US- 08 8 826P . 

PR ll-JUN-1998; 98US- 08 8861P . 

PR ll-JUN-1998; 98US-088863P . 

PR ll-JUN-1998; 98US-088876P. 

PR 12-JUN-1998; 98US-089090P . 

PR 12-JUN-1998; 98US-08 9105P. 

PR 16-JUN-1998; 98US-08 9512P. 

PR 16-JUN-1998 ; 98US - 08 9514P . 

PR 17-JUN-1998; ■ 98US-089538P . 

PR 17-JUN-1998; 98US-08 9598P. 

PR 17-JUN-1998; 98US- 08 9653P . 

PR 18-JUN-1998; 98US-089908P. 

PR 19-JUN-1998 ; 98US-08 9952P . 

PR 22-JUN-1998; 98US- 09024 6P . 

PR 22-JUN-1998; 98US-090252P . 

PR 22-JUN-1998 ; 98US-0902 54P . 

PR 24-JUN-1998 ; 98US- 09042 9P . 

PR 24-JUN-1998; 98US-090435P . 

PR 24-JUN-1998; 98US- 090444 P . 

PR 24-JUN-1998 ; 98US-0904 61P . 

PR 24-JUN-1998 ; 98US- 09053 5P . 

PR 24-JUN-1998 ; 98US- 09 054 OP . 

PR 25-JUN-1998; 98US-090676P . 

PR 25-JUN-1998; 98US-090678P . 

PR 25-JUN-1998 ; 98US- 09068 8P . 

PR 25-JUN-1998 ; 98US -09069 OP . 

PR 25-JUN-1998 ; 98US-09 0694P . 

PR 25-JUN-1998; 98US-090695P . 

PR 25-JUN-1998; 98US-090696P . 

PR 26-JUN-1998; 98US- 090862P . 

PR 26-JUN-1998 ; 98US- 090863 P . 

PR 26-JUN-1998; 98US-0 91010P . 

PR 01 -JUL- 1998; 98US-091359P . 

PR 01 -JUL- 1998; 98US- 091544 P. 

PR 02 -JUL- 1998; 98US-091478P . 

PR 02 -JUL- 1998; 98US-091486P . 

PR 02 -JUL- 1998; 98US-091626P. 

PR 02 -JUL- 1998; 98US-091628P . 

PR 02 -JUL- 1998; 98US-091632P . 

PR 24 -JUL- 19 98; 98US-094 006P. 



PR 04-AUG-1998; 98US- 095282P . 

PR 10-AUG-1998; 98US-095998P . 

PR 10-AUG-1998; 98US-096012P . 

PR 17-AUG-1998; 98US-096757P . 

PR 17-AUG-1998; 98US-096766P . 

PR 17-AUG-1998; 98US-096867P . 

PR 17-AUG-1998; 98US-096891P . 

PR 17-AUG-1998; 98US-096897P . 

PR 18-AUG-1998; 98US-09694 9P . 

PR 18-AUG-1998; 98US-096959P . 

PR 18-AUG-1998; 98US- 097022P . 

PR 26-AUG-1998; 98US- 097952P . 

PR 26-AUG-1998; 98US-097954P . 

PR 26-AUG-1998; 98US- 097955P . 

PR 26-AUG-1998; 98US-097971P . 

PR 26-AUG-1998; 98US-097974P . 

PR 26-AUG-1998; 98US-098 014P . 

PR 01-SEP-1998; 98US-098716P . 

PR 01-SEP-1998; 98US-098723P . 

PR 02-SEP-1998; 98US-0988 03P . 

PR 02-SEP-1998; 98US-098821P . 

PR 02-SEP-1998; 98US-098843P . 

PR 09-SEP-1998; 98US-099602P . 

PR 10-SEP-1998; 98US - 09974 IP . 

PR 10-SEP-1998; 98US-099754P . 

PR 10-SEP-1998; 98US-099763P . 

Query Match 99.8%; Score 2401; DB 24; Length 448; 

Best Local Similarity 99.8%; Pred. No. 3.2e-159; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

! I i 1 1 : 1 1 1 1 1 . i J I i 1 1 1 i 1 1 1 1 1 1 1 , 1 1 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

IIIIIIIIIIMIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIIMIIIIIIII 

Db 14 6 NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

MMI ilMIIMI II 1 1 1 . 1 f I ' 1 1 ; II : F I II F 1 1 ! I II I 

Db 2 06 QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIMMIIMIIIIMIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 326 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQ 420 

IIIIIIIIIIIMIMIIIIMIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIII 



Db 386 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 445 



Qy 421 YPF 423 

III 

Db 446 YPF 448 

RESULT 15 
ABU67141 

ID ABU67141 standard; Protein; 448 AA. 
XX 

AC ABU67141; 
XX 

DT 28-MAY-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PRO210, 
XX 

KW Secreted and transmembrane polypeptide; PRO polypeptide; PR0533; 

KW PR0301; PR0187; PR0337; PR01411; PRO10096; PR0246; PRO6307; PRO6003; 

KW PRO6004; PR04356; PRO2630; PR0265; PR0941; FGFR; bioactive molecule; 

KW fibroblast growth factor receptor; cell death; chromosome mapping; 

KW gene mapping; transgenic animal; knockout animal; gene therapy ; 

KW tumour; obesity; diabetes; insulinaemia; vascular permeability; 

KW cardiac insufficiency disorder; immune response; hearing loss; 

KW auditory hair cell regeneration; bone disorder; cartilage disorder; 

KW sports injury; arthritis. 

XX 

OS Homo sapiens . 
XX 

PN US2003032062-A1. 
XX 

P'D 13-FEB-2003. 
XX 

PF 01-FEB-2002; 2002US-0066273 . 
XX 

PR 14-JUL-1998; 98WO-US14552 . 

PR 10-SEP-1998; 98WO-US18824 . 

PR 14-SEP-1998; 98WO-US19093 . 

PR 16-SEP-1998; 98WO-US19330 . 

PR 17-SEP-1998; 98WO-US1 9437 . 

PR 20-NOV-1998; 98WO-US24 8 55 . 

PR 25-NOV-1998; 98WO-US25 1 90 . 

PR 01-DEC-1998; 98WO-US25108 . 

PR 08-MAR-1999; 99WO-US05028 . 

PR 02-JUN-1999; 99WO-US12252 . 

PR 01-SEP-1999; 99WO-US20111 . 

PR 08-SEP-1999; 99WO-US2 0594 . 

, PR 15-SEP-1999; 99WO-US2 1090 . 

PR 15-SEP-1999; 99WO-US21547 . 

PR 30-NOV-1999; 99WO-US28313 . 

PR 01-DEC-1999; 9 9WO-US283 01 . 

PR 02-DEC-1999; 99WO-US28565 . 

PR 20-DEC-1999; 99WO-US30999 . 

PR 05-JAN-2000; 2000WO-US00219 . 

PR 18-FEB-2000; 2000WO-US0434 1 . 

PR 18-FEB-2000; 2000WO-US04342 . 

PR 22-FEB-2000; 2000WO-US044 14 . 



PR 01-MAR-2000; 2000WO-US05601 . 

PR 02-MAR-2000; 2000WO-US05841 . 

PR 09-MAR-2000; 2000WO-US06471 . 

PR 20-MAR-2000; 2000WO-US07377 . 

PR 30-MAR-2000; 2000WO-US08439 . 

PR 15-MAY-2000; 2000WO-US13358 . 

PR 17-MAY-2000; 2000WO-US13705 . 

PR 22-MAY-2000; 2000WO-US14042 . 

PR 30-MAY-2000; 2000WO-US14941 . 

PR 02-JUN-2000; 2000WO-US15264 . 

PR ll-AUG-2000; 2000WO-US22031 . 

PR 23-AUG-2000; 2000WO-US23522 . 

PR 24-AUG-2000; 2000WO-US23328 . 

PR 01-DEC-2000; 2000WO-US32678 . 

PR 28-FEB-2001; 2001WO-US06520 . 

PR 30-MAY-2001; 2001WO-US17443 . 

PR 01-JUN-2001; 2001WO-US17800 . 

PR 20-JUN-2001; 2001WO-US19692 . 

PR 29-JUN-2001; 2001WO-US2 1066 . 

PR 09-JUL-2001; 2001WO-US21735 . 

PR 26-AUG-1997; 97US- 056974P . 

PR 17-SEP-1997; 97US-059115P. 

PR 18-SEP-1997; 97US-059263P. 

PR 19-SEP-1997; 97US-059588P. 

PR 17-OCT-1997; 97US- 062285P . 

PR 24-OCT-1997; 97US-062816P . 

PR 24-OCT-1997; 97US-063082P . 

PR 27-OCT-1997; 97US - 06332 9P . 

PR 29-OCT-1997; 97US-063733P . 

PR 21-NOV-1997; 97US-066364P . 

PR 25-NOV-1997; 97US- 06684 OP . 

PR 16-DEC-1997; 97US-069694P . 

PR 09-FEB-1998; 98US-074086P . 

PR 09-FEB-1998; 98US-074092P . 

PR 25-MAR-1998; 98US-079294P . 

PR 08-APR-1998; 98US- 08104 9P . 

PR 10-AUG-1998; 98US-095998P . 

PR 18-AUG-1998; 98US-097000P . 

PR 09-SEP-1998 ; 98US-099601P . 

PR 10-SEP-1998; 98US-0998 03P . 

PR 10-SEP-1998; 98US-099811P . 

PR 10-SEP-1998; 98US-099812P . 

PR 17-SEP-1998 ; 98US-100858P. 

PR 24-SEP-1998; 98US-101922P . 

PR 28-OCT-1998; 98US-106032P . 

PR 20-NOV-1998; 98US-109304P . 

PR 23-MAR-1999; 99US-125778P . 

PR 15-JUN-1999; 99US-139695P . 

PR 20-JUL-1999; 99US-145070P . 

PR 26-JUL-1999; 99US-145698P . 

PR 17-AUG-1999; 99US-149396P . 

PR 07-DEC-1999; 99US-1694 95P . 

PR 15-NOV-2001; 2001US-0002796 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi AJ, Baker KP, Botstein DA, Desnoyers L, Eaton DL; 



PI Ferrara N, Fong S, Gao W, Gerber H, Gerritsen ME, Goddard A; 

PI Godowski pj, Gurney AL, Kljavin u, Mather JP, Napier MA, Pan J; 

PI Paoni NF, Roy MA, Stewart TA, Tumas D, Watanabe CK, Williams PM; 

PI Wood WI, Zhang Z; 

XX 

DR WPI; 2003-341963/32. 

DR N-PSDB; ACA04448. 
XX 

PT New secreted and transmembrane polypeptide for modulating biological 

PT activity of a cell expressing the polypeptide, identifying agonists or 

PT antagonists of the polypeptide, and as molecular weight markers 
XX 

PS Claim 12; Fig 10; 254pp ; English. 
XX 

CC The invention describes an isolated, secreted and transmembrane 

CC polypeptide (I), termed PRO polypeptide. (I) Is useful for detecting 

CC PR0533, PRO301, PR0187, PR0337, PR01411, PRO10096, PR0246, PRO6307, 

CC PRO6003, PRO6004, PR04356, PRO2630, PR0265, PR0941, fibroblast growth 

CC factor receptor (FGFR) -4 , FGFR-3, FGFR-2 or FGFR-1 polypeptide, and for 

CC linking a bioactive molecule e.g. toxin, radiolabel or antibody, to a 

CC cell expressing the polypeptides. The bioactive molecule causes cell 

CC death. (II) Is useful as hybridisation probes, in chromosome and gene 

CC mapping, in generation of antisense RNA and DNA, in the preparation of 

CC PRO polypeptide, for generating transgenic animals or knockout animals 

CC which in turn are useful in the development and screening of 

CC therapeutically useful reagents, and for the genetic analysis of 

CC individuals with genetic disorders, in gene therapy, and for chromosome 

CC identification. (I) Or Ab is useful for the preparation of medicament for 

CC treating conditions which are responsive to the PRO polypeptide or 

CC anti-PRO antibody e.g. a tumour. (I) is useful for treating obesity, 

CC diabetes or hypo- or hyper- insulinaemia, and cardiac insufficiency 

CC disorders, for inhibiting tumour growth, enhances vascular permeability 

CC and immune response, for inducing regeneration of auditory hair cells and 

CC for treating hearing loss in mammals, and for treating bone and/or 

CC cartilage disorders such as sports injuries and arthritis. This is the 

CC amino acid sequence of a novel human secreted and transmembrane 

CC polypeptide. 

XX 

SQ Sequence 448 AA; 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 422; Conservative 



Score 2401; DB 24 
Pred. No. 3.2e-159 
0; Mismatches 1 



Length 44 8; 

Indels 0; Gaps 0; 



Qy 
Db 



1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 

IIIIIMMMIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

2 6 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 



60 



85 



Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IIIIIMIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIII 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 



Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 







i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


QDWECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


265 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


300 






i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i j i 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


325 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 


360 






i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I 




Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 


385 


Qy 


361 


GNEGREF YMROTGP I SATLVMTRP I KGPRE I OLDLEM I TVNTVI NFRHCJ 1 VT RT .R T YVCin 


490 






1 ! ' i ! : 1 . : i 1:1 I ! ; i 




Db 


386 


GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQ 


445 


Qy 


421 


YPF 423 




Db 


446 


III 

YPF 448 
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Job time : 38.3949 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 004 . Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 9, 2004, 12:30:17 ; Search time 15.0551 Seconds 

( wi t hout a 1 ignment s ) 
1188.799 Million cell updates/sec 

US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLDIDE INFRGSSVIRLRI YVSQYPF 423 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



BLOSUM62 
Gapop 10.0 



328717 



Gapext 0 . 5 

Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : / cgn2_6 /ptodata/l/iaa/ 6A_COMB . p ep : * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_C0MB.pep : * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * - 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-884-072-1 

; Sequence 1, Application US/08884072 
; Patent No. 5872234 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /884 , 072 

FILING DATE: Herewith 

CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 

TELEX : 

; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 44 8 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: CORNNOT01 

CLONE: 45517 
US-08-884-072-1 

Query Match 100.0%; Score 2407; DB 2; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.7e-182; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGG YLC I PRTNPVYRGPYSNPYS 


60 






Ml MINI III II II II II MM II MINI II II Mill MINIM II MM II III 




Db 


26 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGG YLC I PRTNPVYRGPYSNPYS 


85 


Qy 


61 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 


120 






II 1 III III 1 III 1 MM 1 II III Mill 1 Mill 1 III II II llllll MM MM III 




Db 


86 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 


145 


Qy 


121 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


180 






IIIMIMIIIIIIIIMIIIIMIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIII 




Db 


146 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


205 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


240 






i: II III II ILJI illMliili 1 Ih III Mull. MM 




Db 


206 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


265 


Qy 


241 


PGTYFCS CP PG Y I LLDDNRS CQD I NECEHRNHTCNLQQTCYNLQGGFKC IDPIRCEEPYL 


300 






II MMIII IIIIMII 'IMII II IL III llhlll IIMII III 




Db 


266 


PGTYFCS CP PGY I LLDDNRS CQD I NECEHRNHTCNLQQTCYNLQGGFKC IDPIRCEEPYL 


325 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTR Y PGAY Y I FQI KS 


360 






MIIIMIIIIIIIIIIMIIIIIMIIIIIIIIIMIIIIIIIMIIIIMIIMIIII 




Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAY Y I FQI KS 


385 



Qy 361 GNEGREF YMRQTGP I SATLVMTR P I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 420 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 386 GNEGREF YMRQTGP I SATLVMTR PI KG PRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 2 
US-09-212-168-1 

; Sequence 1, Application US/09212168 
; Patent No. 6303765 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/212,168 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/884,072 

FILING DATE: 
ATTORNEY /AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/ DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 

TELEX : 

INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 448 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: CORNNOT01 

CLONE: 45517 
US-09-212-168-1 



Query Match 100.0%; Score 2407; DB 4; Length 448; 

Best Local Similarity 100.0%; Pred. No. 1.7e-182; * 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIIIMIIIIIIMIMIIIIIIIIIIIIIIIIIMIIIMIIIIIIIMIIIIIIII 

QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 
TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

11:1 II : I 1 1 ! i 1 1 j I ! 1 1 ' 1 1 E 1 1 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 180 

IIIIIIIIMIIIIIIIIIMIMMIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIII 

NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 205 
QDVNECATENPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

IIIIIIIIIMIMIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 300 

IIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIMIIIMIIIIIMIIM 

PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 
RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 360 

IIIMIIIIIIIIIIIIIMIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIMIIMI 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 385 
GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 420 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 445 

YPF 423 
III 



RESULT 3 

US-09-312-283C-186 

Sequence 186, Application US/09312283C 
Patent No. 6573095 
GENERAL INFORMATION: 

APPLICANT: Watson, James D. 

APPLICANT: Strachan, Lorna 

APPLICANT: Sleeman, Matthew 

APPLICANT: Onrust, Rene 

APPLICANT: Murison, James G. 

APPLICANT: Kumble, Krishanand D. 

TITLE OF INVENTION: Compositions Isolated from Skin Cells 
TITLE OF INVENTION: and Methods for Their Use 
FILE REFERENCE: 11000.1 01 lc2 

CURRENT APPLICATION NUMBER: US/09/312 , 283C 
CURRENT FILING DATE: 1999-05-14 
NUMBER OF SEQ ID NOS : 425 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 186 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



LENGTH: 335 
TYPE: PRT 
ORGANISM: Mouse 
US-09-312-283C-186 

Query Match 75.9%; Score 1827; DB 4; Length 335; 

Best Local Similarity 95.2%; Pred. No. 7.8e-137; 

Matches 319; Conservative 7; Mismatches 9; Indels 0; Gaps 0; 
Qy 8 9 RFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECR 148 

II I II I II II II I MM MM II MINIMI I II MM III II II MM Mill III 

Db 1 RFGYQMDEGNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECR 60 

Qy 149 YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 208 

II Ml I III II 'II 'll l : ' i lllllliiiiiiiiiiniii 

Db 61 YGYCQQLCANVPGSYSCTCNPGFTLNDDGRSCQDVNECETENPCVQTCVNTYGSFICRCD 120 

Qy 2 09 PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYI LLDDNRSCQDI NECE 268 

IIMIMIhMIIMIIIIIIIIIIIIIIIIIhMllllllhlhMIIIIMIIII 

Db 121 PGYELEEDGIHCSDMDECSFSEFLCQHECVNQPGSYFCSCPPGYVLLEDNRSCQDINECE 180 

Qy 269 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDM 328 

Ml II I MMMIMMIMM MUM I I III MUM M i n II 1 

Db 181 HRNHTCTPLQTCYNLQGGFKCIDPIVCEEPYLLIGDNRCMCPAENTGCRDQPFTILFRDM 240 

Qy 329 DWSGRSVPADIFQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPIKGP 388 

M 1 1 1 1 1 1 1 M I II II 1 1 M 1 1 II II II II 1 1 1 1 1 1 Ml I Ml 1 1 1 1 1 1 II I Ml 1 1 

Db 241 DWSGRS VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTR PI KG P 300 

Qy 389 RE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQYPF 423 

Ml II Ml MM II I II I MM I II MM 1 1 1 M 

Db 301 RDI QLDLEMI TVNTVI NFRGSSVI RLRI YVSQYPF 335 

RESULT 4 

US-09-188-930-186 

Sequence 186, Application US/09188930A 
Patent No. 6150502 
GENERAL INFORMATION: 
APPLICANT: Watson, James D. 
APPLICANT: Strachan, Lorna 
APPLICANT: Sleeman, Matthew 
APPLICANT: Onrust , Rene 
APPLICANT: Murison, James Greg 

TITLE OF INVENTION: Compositions Isolated From Skin Cells 
TITLE OF INVENTION: and Methods For Their Use 
FILE REFERENCE: 11000. lOllcl 
CURRENT APPLICATION NUMBER: US/09/188 , 930A 
CURRENT FILING DATE: 1998-11-09 
NUMBER OF SEQ ID NOS : 348 
SOFTWARE: Fa st SEQ for Windows Version 3.0 
SEQ ID NO 186 
LENGTH: 337 
TYPE : PRT 
ORGANISM: Rat 
FEATURE : 

NAME/ KEY: UNSURE 



LOCATION 


(14) 


(124) 


; NAME /KEY 


UNSURE 




; LOCATION 


(135) 


(135) 


; NAME /KEY 


UNSURE 




; LOCATION 


(141) 


(141) 


; ' NAME/ KEY 


UNSURE 




LOCATION 


(166) . . 


. (166) 


NAME/KEY 


UNSURE 




LOCATION 


(167) . . 


. (167) 


NAME/ KEY 


UNSURE 




LOCATION 


(183) . . 


. (183) 


US-09-188-930-186 




Query Match 


7 



%; Score 1791; DB 3; Length 337; 
Best Local Similarity 93.7%; Pred. No. 5.4e-134; 

Matches 314; Conservative 6; Mismatches 15; Indels 0; Gaps 0; 

RFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQCLDIDECR 148 

Illlllll MMMMMMMMMMIMMMIMMMMMMIMIMIMM 

RFGYQMDEGNQCVDVDECATDSHQCNPTQI CI NTEGGYTCSCTDGYWLLEGQCLDI DECR 62 
YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 208 

1 i ! 1 !■ : ;i: ; i lllllllllllllllllllll 

YGYCQQLCANVPGSYSCTCNPGFTLNDDGRSCQDVNECETENPCVQTCVNTYGSFICRCD 122 

PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECE 268 

I Illlllhll Mill llllllllllllllhllllllll Ihllllllllllll 
PXYELEEDGIHCXDMDECXFSEFLCQHECVNQPGSYFCSCPPGXXLLEDNRSCQDINECE 182 

HRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYLRI SDNRCMCPAENPGCRDQPFTI LYRDM 328 

Mill IMIMMMMMM llllll I llllilllll IIIMIIIIhlll 

XRNHTCTPLQTCYNLQGGFKCIDPIVCEEPYLLIGDNRCMCPAENTGCRDQPFTILFRDM 242 
DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTG PI SATLVMTRPI KGP 388 

MMMMMMIMMMMMMMMMMIMMMMMMMMMMMIMI 

DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGP 302 
RE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQYPF 423 

h II Mill I Ml II III III II Illlllll MM 

RD I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQYPF 337 
RESULT 5 

US-08-833-963C-2 

Sequence 2, Application US/08833963C 
Patent No. 5916769 
GENERAL INFORMATION: 

APPLICANT: Olsen, et al . 

TITLE OF INVENTION: Ext racel lular/ Epidermal Growth Factor 
TITLE OF INVENTION: H CABAS 8 X 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Ave 
CITY: Rockville 
STATE : MD 
COUNTRY : USA 



Qy 


89 


Db 


3 


Qy 


149 


Db 


63 


Qy 


209 


Db 


123 


Qy 


269 


Db 


183 


Qy 


329 


Db 


243 


Qy 


389 


Db 


303 



ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS. 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /833 , 963C 

FILING DATE: ll-APR-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US96/05033 

FILING DATE: 10-APR-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Brookes , A. Anders 

REGISTRATION NUMBER: 36,373 

REFERENCE/DOCKET NUMBER: PF258 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 443 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE:, protein 
US-08-833-963C-2 

Query Match 53.0%; Score 1276.5; DB 2; Length 443; 

Best Local Similarity 52.0%; Pred. No. 3.1e-93; 

Matches 220; Conservative 67; Mismatches 119; Indels 17; Gaps 4; 



Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I I I I h:|| Mlllhhl hi Mllhlh I 
Db 38 ECTDGYEWDPDSQHCRDVNECLTI PEACKGEMKCI NHYGGYLCLPRSAAVI NDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

II I II II I Ih I : MINIM I I hi I 

Db 94 EGP-PPPVPPAQHPN PCPPGYEPDDQDSCVDVDECAQALHDCRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi Ml : I III I , MINI : III 

Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

1 1 1 1 1 II I I I 1 1 II II Mhlllhl I I 

Db 202 VDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

I I : I I I II II I lllhlll I h III I I I : : I : I II llh 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYI 320 

Qy 301 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKS 360 

MIMI II Ihll M::| I : I lllllhlhllh Mill III- 
Db 321 QVS ENRCLCPASN PLCREQPS S I VHR YMTI TS ERS VPAD VFQ I QATS VY PGAYNAFQ I RA 380 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 420 



Db 381 GNSQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMWMNSIJ^SYRASSVLRLTVFVGA 440 

Qy 421 YPF 423 

I I 

Db 441 YTF 443 



RESULT 6 
US-08-980-514-1 

■; Sequence 1, Application US/08980514 
; Patent No. 6004753 
; GENERAL INFORMATION: 

APPLICANT : Yue, Henry 

APPLICANT: Guegler, Karl J. 

APPLICANT: Shah, Purvi 

TITLE OF INVENTION: HUMAN SI - 5 -ECMP-LI KE PROT 
TITLE OF INVENTION: EIN 
NUMBER OF SEQUENCES : 3 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Dr. 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/980 , 514 

FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
t FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/DOCKET NUMBER: PF-0436 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 650-855-0555 

TELEFAX: 650-845-4166 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 443 amino acids 

TYPE: amino acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRSTNOT13 

CLONE: 2786449 
US-08-980-514-1 



Query Match 53.0%; 
Best Local Similarity 52.0%; 



Score 1276.5; DB 3; Length 443; 
Pred. No. 3.1e-93; 



Matches 220; Conservative 67; Mismatches 119; Indels 17; Gaps 4; 



Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

W-\ I I I h:|| MMIhhl hi llllhlh | 
Db 38 ECTDGYEWDPDSQHCRDWECLTIPEACKGEMKCINHYGGYLCLPRSAAVINDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

II I II II I Ih h : llllllll I I hi | 

Db 94 EGP-PPPVPPAQHPN PCPPGYEPDDQDSCVDVDECAQALHDCRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi Ml : MMIIIMI IN | hllh I I III I : III 

Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

Mill M I I hlhhill MM M llhlllhl Mlh Mh 

Db 202 VDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

II : I M II M I I I I I : i I I I h III I i II Mh 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYI 32 0 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

Mil || I I | : | | | | | | | : | | : | | | : Mill III- 

Db 321 QVSENRCLCPASNPLCREQPSS I VHRYMTI TSERS VPADVFQI QATSVYPGAYNAFQI RA 380 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 

II Ih' I Ih ||: | | | | II M M Ih I ::| 

Db 381 GNSQGDFYIRQINNVSAMLVIARPvTGPREYvTiDLEMvTMNSLMSYRASSVLRLTVFVGA 44 0 

Qy 421 YPF 423 

I I 

Db 441 YTF 443 



RESULT 7 
US-08-884-072-5 

; Sequence 5, Application US/08884072 
; Patent No. 5872234 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM : 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 



CURRENT APPLICATION DATA: • 

APPLICATION NUMBER: US/08/8 84,072 
FILING DATE: Herewith 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT I N FORMAT I ON : 
NAME: Billings, Lucy J . 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUN I CAT I ON I NFORMAT I ON : 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
TELEX : 

INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 387 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 458228 
US-08-884-072-5 

Query Match 40.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No. 1.4e-68; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 71 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSC 13 0 

II 1=1 I : I I I : III hill U I hill I I 

Db 38 ADPQRI PSNP - - SHRI QCAAGYEQSEHNVCQDI DECTAGTHNCRADQVCI NLRGSFACQC 95 

Qy 131 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 

II Ihlllll II I I I llh I hill | : I |:||| 

Db 96 PPGYQKRGEQCVDI DECTI PPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 190 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 24 9" 

I I I I I llllhh I I I I I M hill I :|lh:|lhll : I II 
Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 250 PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMC 309 

II = I = E I I I I I I I II M IM I :|| IN 

Db 216 QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 273 

Qy 310 PAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYM 369 

I I lh I :hh I : I lllhlllhllll I hllllll llh 

Db 274 PVSNAMCRELPQSI VYKYMSIRSDRSVPSDIFQIQATTI YANTINTFRIKSGNENGEFYL 333 

Qy 370 RQTGP I SATLVMTRP I KGPREI QLDLEM I TVNTVI NFRGSSVI RLR I YVSQYPF 423 

III hll lh : : I I I I II Ml || Mhll I I = I 

Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF 387 



RESULT 8 

US-08-833-963C-9 



/ Sequence 9, Application US/08833963C 

; Patent No. 5916769 

; GENERAL INFORMATION: 

APPLICANT : Olsen, et al . 

TITLE OF INVENTION: Extracellular/Epidermal Growth Factor 
TITLE OF INVENTION: H CABAS 8 X 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Ave 

CITY: Rockville 

STATE : MD 

COUNTRY : USA 

ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: . US/08 /833 , 963C 

FILING DATE: ll-APR-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US96/05033 

FILING DATE: 10-APR-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME : Brookes , A . Anders 

REGISTRATION NUMBER: 36,373 

REFERENCE/DOCKET NUMBER: PF258 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 

LENGTH:. 387 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-963C-9 

Query Match 40.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No. 1.4e-68; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 71 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSC 130 

M h I I : I Ih I i I hill = | I hill I : I I 
Db 38 ADPQRI PSNP - - SHRI QCAAGYEQSEHNVCQDI DECTAGTHNCRADQVCI NLRGSFACQC 95 

Qy 131 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 



Db 



"I M*lMM II* I 1*111 I * * I 1*111 

96 P PG YQKRGEQCVD I DECTI PP YCHQRCVNTPGS F YCQCS PGFQLAANN YTCVDI NECDAS 155 



Qy 




Db 



Qy 250 PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMC 309 

II -II 1 IN || I |||:| |::||: 'l 

Db 216 QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 273 

Qy 310 PAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKSGNEGREFYM 369 

I I I I : I ^hh I : I I : l!M I | III 
Db 274 PVSNAMCRELPQS I VYKYMS I RSDRSVPSDI FQI QATTI YANTINTFRI KSGNENGEFYL 333 

Qy 370 RQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRI YVSQYPF 423 

III M Ih : : I I I I I M , : II |||:|| | | : | 
Db 334 RQTS P VSAMLVLVKSLSG PREH I VDLEMLTVS S I GTFRTS S VLRLT 1 1 VG PFS F 387 



RESULT 9 
US-08-980-514-3 

; Sequence 3, Application US/08 98 0514 

; Patent No. 6004753 

; GENERAL INFORMATION: 

APPLICANT: Yue, Henry 

APPLICANT: Guegler, Karl J. 

APPLICANT: Shah, Purvi 

TITLE OF INVENTION: HUMAN SI -5 -ECMP-LIKE PROT 
TITLE OF INVENTION: EIN 
NUMBER OF SEQUENCES: 3 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Dr. 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 / 98 0 , 5 14 

FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/DOCKET NUMBER: PF-0436 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 650-855-0555 

TELEFAX: 650-845-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 387 amino acids 

TYPE: amino acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 



LIBRARY: GenBank 
CLONE: 458228 
US-08-980-514-3 



Query Match 4 0.0%; Score 963.5; DB 3; Length 387; 

Best Local Similarity 48.3%; Pred. No. 1.4e-68; 

Matches 171; Conservative '55; Mismatches 123; Indels 5; Gaps 4; 



Qy 71 APPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSC 130 

II hi I = I 11 = Ml hill :| | h||| | : | | 

Db 38 ADPQRI PSNP- -SHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQC 95 

Qy 131 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 

II Ihlllll II I I I Mh I hill I = = 1 hill 

Db 96 PPGYQKRGEQCVDI DECTI PPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 190 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 249 

I I I I I 11111 = 1= I I I I I = = l hill I - - I :! = I II 
Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 250 PGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPIRCEEPYLRI SDNRCMC 309 

II = = =1 = 11111111 II =1 = 1 111 = 1 l = = ll= =111 = 1 

Db 216 QGYQW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVC 273 

Qy 310 PAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYM 369 

I I 11= I =1 = 1= I = I lllhlllhllll I hill ||h 

Db 274 PVSNAMCRELPQS I VYKYMS I RSDRS VPSDI FQI QATTI YANTINTFRI KSGNENGEFYL 333, 

Qy 370 RQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQYPF 423 

III I 11= = = I I I I : II I I : I I : : : II I 11=1 

Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTI IVGPFSF 387 



RESULT 10 
US-09-212-168-5 

; Sequence 5, Application US/09212168 
; Patent No. 6303765 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Corley, Neil C. 

APPLICANT: Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 
NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : USA 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/212,168 



FILING DATE : 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/884,072 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 

TELEX : 

; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 387 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 458228 
US-09-212-168-5 

Query Match 40.0%; Score 963.5; DB 4; Length 387; 

Best Local Similarity 48.3%; Pred. No. 1.4e-68; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 

Qy 71 APPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSC 130 

II 1= I I : I 11= I I I hill I I hill I : I I 

Db 38 ADPQRI PSNP- -SHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQC 95 

Qy 131 TDGYWLLEGQCLDI DECRY -GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 

II Ihlllll II I I I llh I hill I : :| hill 

Db 96 PPGYQKRGEQCVDIDECTI PPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 190 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 24 9 

I I I I I IIMhh I I I I I ::| hill I : I I I : : I I I : I I = I II 
Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 250 PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMC 309 

II :hllllllll II Mhl hHh Hlhl 

Db 216 QGYQW-RSRTCQDI NECETTNE - CREDEMCWNYHGGFRCYPRNPCQDPYI LTPENRCVC 273 

Qy 310 PAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYM 369 

I :hh I : I lllhlllhllll I hllllll llh 

Db 274 PVSNAMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTI YANTINTFRIKSGNENGEFYL 333 

Qy 370 RQTGP I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ YP F 423 

III III I : : I I I I :||lhlh:: II llhll II 

Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF 387 



RESULT 11 
US-09-188-930-336 

; Sequence 336, Application US/09188930A 
; Patent No. 6150502 



GENERAL INFORMATION: 
APPLICANT: Watson, James D. 
APPLICANT: Strachan, Lorna 
APPLICANT: Sleeman, Matthew 
APPLICANT: Onrust, Rene 
APPLICANT: Murison, James Greg 

TITLE OF INVENTION: Compositions Isolated From Skin Cells 
TITLE OF INVENTION: and Methods For Their Use 
FILE REFERENCE: 11000. lOllCl 

CURRENT APPLICATION NUMBER : US/09/188 , 930A 
CURRENT FILING DATE: 1998-11-09 
NUMBER OF SEQ ID NOS : 348 
SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 336 
LENGTH: 274 
TYPE : PRT 
ORGANISM: Human 
US-09-188-930-336 

Query Match 37.9%; Score 912.5/ DB 3; Length 274; 

Best Local Similarity 56.0%; Pred. No. le-64; 

Matches 154 ; Conservative 47 ; Mismatches 73 ; Indels 1; Gaps 1; 

Qy 14 9 YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 208 

I III I hllh I I III I : III Mill II I I hlhhill 

Db 1 YRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCH 60 

Qy 2 09 PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECE 268 

I I I I M llhlllhl :Mh I : I II II II I lllhlll 

Db 61 QGYELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECE 119 

Qy 269 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDM 328 

I h I I I I ! II llh::hllhlll II Ihll :h:| I 

Db 120 SGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYM 179 

Qy 329 DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGP 388 

' : I I I I I I I : I h I I h I I I I I I I I I :||:|| =11 I |: II : II 

Db 180 TITS ERS VPADVFQ I QATS VY PGAYNAFQ I RAGNSQGDF Y I RQ I NNVSAMLVLAR P VTGP 239 

Qy 389 RE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ Y P F 423 

II I. >: : : ■ Mhll ::| I I 

Db 24 0 REYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 274 



RESULT 12 

US-09-312-283C-336 

Sequence 336, Application US/09312283C 
Patent No. 6573095 
GENERAL INFORMATION: 
APPLICANT: Watson, James D. 
APPLICANT: Strachan, Lorna 
APPLICANT: Sleeman, Matthew 
APPLICANT: Onrust, Rene 
APPLICANT: Murison, James G. 
APPLICANT: Kumble, Krishanand D. 

TITLE OF INVENTION: Compositions Isolated from Skin Cells 
TITLE OF INVENTION: and Methods for Their Use 



FILE REFERENCE: 11000. 1011c2 

CURRENT APPLICATION NUMBER: US/09/3 12 , 283C 
CURRENT FILING. DATE: 1999-05-14 
NUMBER OF SEQ ID NOS : 425 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 336 
LENGTH: 274 
TYPE : PRT 
ORGANISM: Mouse 
US-09-312-283C-336 

Query Match 37.9%; Score 912.5; DB 4; Length 274; 

Best Local Similarity 56.0%; Pred. No. le-64; 

Matches 154; Conservative 47; Mismatches 73; Indels 1; Gaps 1; 
Qy 149 YGYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCD 208 

! Ml I hllh I I III I : Ml Mill II I I hlhhill 

Db 1 YRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCH 60 

Qy 209 PG YELEEDGVHCSDMDECS FSEFLCQHECVNQPGTYFCSCP PGY I LLDDNRS CQDI NECE 268 

MM M llhlllhl Mlh MIMI : I II II II I lllhlll 
Db 61 QGYELHRDGFSCSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLL-ATRLCQDIDECE 119 

Qy 269 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPGCRDQPFTILYRDM 328 

I h Ml I I II ■ \ \\-\\ || Ihll = I = : I I 

Db 120 SGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYIQVSENRCLCPASNPLCREQPSSIVHRYM 179 

Qy 329 DWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGP 388 

= I lllllhlhllh Mill MhMI Mhll Ml Ih Ih II 

Db 180 TITS ERS VPAD VFQ I QATS VYPGA YNAFQ I RAGNSQGDF Y I RQ I NNVSAMLVLARP VTG P 239 

Qy 389 RE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQYPF 423 

II lllll = |:|: = : = = | Mhll : = | I I 
Db 240 REYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 274 



RESULT 13 
5177197-30 
Patent No. 5177197 

APPLICANT: KANZAKI , TETSUTO; OLOFSSON, ANDERS ; MOREN, ANITA; 
WERNSTEDT, CHRI STER ; HELLMAN , ULF ; MI YAZONO , KOHEI ; CLAESSON -WELSH, 
LENA ; HELD I N , CARL-HENRI K 

TITLE OF INVENTION: ISOLATED NUCLEOTIDE SEQUENCE EXPRESSING 
HUMAN TRANSFORMING GROWTH FACTOR-BETA1 -BINDING PROTEIN 
NUMBER OF SEQUENCES: 53 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/487 , 343 
FILING DATE: 27-FEB-1990 
SEQ ID NO: 30: 

LENGTH: 1394 
5177197-30 

Query Match 22.4%; Score 539; DB 6; Length 1394; 

Best Local Similarity 29.6%; Pred. No. 1.9e-34; 

Matches 126; Conservative 51; Mismatches 126; Indels 122; Gaps 17; 



Qy 



2 CTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 



I h I hill : I || | :||| 

Db 573 CYEGYRFSEQQRKCVDIDECTQVQHLC-SQGRCENTEGSFLCI 614 

Qy 62 PYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMDESNQCVDVD 104 

: |: : : || : | ||:| : :| |:| 

Db 615 - CPAGFMASEEGTNCI DVDECLRPDVCGEGHCVNTVGAFRCEYCDSGYRMTQRGRCEDI D 673 

Qy 105 ECATDSHQCNPTQ I C I NTEGGYTC - SCTDG YWLLEGQCLDI DEC - RYGYCQQ - LCANVPG 161 

II III: hh I I I I I : I : I'M" I hh | 

Db . 674 ECLNPS-TC-PDEQCVNSPGSYQCVPCTEGFRGWNGQCLDVDECLEPNVCANGDCSNLEG 731 

Qy 162 SYSCTCNPGFTLNEDGRSCQDVNECATENPCVQ 194 

II hh hi I : : h I II 

Db 732 SYMCSCHKGYTRTPDHKHCRDIDECQQGNLCVNGQCKNTEGSFRCTCGQGYQLSAAKDQC 791 

Qy 195 TCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQH-ECV 238 

I II III I II II I II h:|| : hi :h 

Db 792 EDIDECQHRHLCAHGQCRNTEGSFQCVCDQGYRASGLGDHCEDINECLEDKSVCQRGDCI 851 

Qy 239 NQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

I hi hi I h IT: I II I | I | | :| | |: 

Db 852 NTAGS YDCTCPDGF - QLDDNKTCQDINECEHPG - LCGPQGECLNTEGSFHCVCQQGFS I S 909 

Qy 291 -DPIRCEEPYLRIS DN RCMC PAENPGCRDQPFTI LYRDM 328 

I lh II I 
Db 910 ADGRTCEDIDECVNNTVCDSHGFCDNTAGSFRCLCYQGFQAPQDGQGCVD VNEC 963 

Qy 329 DWSG 333 

:::|| 

Db 964 ELLSG 968 



RESULT 14 
US-09-482-273-159 

Sequence 159, Application US/09482273 
Patent No. 6534631 
GENERAL INFORMATION: 
APPLICANT: Rosen et al. 

TITLE OF INVENTION: 71 Human Secreted Proteins 
FILE REFERENCE: PZ030P1 

CURRENT APPLICATION NUMBER: US/ 09/4 82 , 273 
CURRENT FILING DATE: 2000-01-13 
EARLIER APPLICATION NUMBER: PCT/US99/1584 9 
EARLIER FILING DATE: 1999-07-14 
EARLIER APPLICATION NUMBER: 60/092,921 
EARLIER FILING DATE: 1998-07-15 
EARLIER APPLICATION NUMBER: 60/092,922 
EARLIER FILING DATE: 1998-07-15 
EARLIER APPLICATION NUMBER: 60/092,956 
EARLIER FILING DATE: 1998-07-15 
NUMBER OF SEQ ID NOS : 267 
SOFTWARE: Patent In Ver. 2.0 
SEQ ID NO 159 
LENGTH: 575 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-482-273-159 



Query Match 21.5%; Score 518; DB 4; Length 575; 

Best Local Similarity 29.5%; Pred. No. 3.2e-33; 

Matches 140; Conservative 56; Mismatches 157; Indels 122; Gaps 20; 
Qy 2 CTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 

I I I I hhlll || Ml I I hi I 

Db 170 CPEGFELDSQGAFCVDVDECAWDAHLCREGQRCVNLLGSYRCLPDCGP 217 

Qy 62 PYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CIN 121 

\- : I Mill h hi | 

Db 218 GFRVADGAGCEDVDECLEGLDDCHYNQLCEN 248 

Qy 122 TEGG YTCS CTDG YWLLEG QCLDI DECRY - -GYCQQLCANVPGSYSCTCNPGFTLNED 176 

I Ih Ml II :M MhMI I I h III I I II II I 

Db 24 9 TPGGHRCSCPRGY-RMQGPSLPCLDVNECLQLPKACAYQCHNLQGSYRCLCPPGQTLLRD 307 

Qy 177 GRSC QD VN E CAT EN P C V QTCVNTYGSFI CRCD 208 

Db 308 GKACTSLERNGQNVTTVSHRGPLLPWLRPWAS I PGTSYHAWVSLRPGPMALS SVG RAW CP 367 

Qy 209 PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECE 268 

Ih : I I : I : I I I MM II hi I II II II 

Db 368 PGF - I RQNGV- CTDLDECRVRN - LCQHACRNTEGS YQCLCPAG YRLLPSGKNCQD I NECE 424 

Qy 269 HRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPA ENPG CRD 318 

: I I Ml =| -hi Ml Ml I 

Db 425 EES I ECGPGQMCFNTRGS YQCVD TPCPATYRQGPS PGTCFRRCSQ 469 

Qy 319 Q PFTILYRDMDWSGRSVPADI FQMQATTR- - YPGAYYI FQIKSGNEGREFYMRQ 371 

| |= || : : | | : : : | : | : : | =| 

Db 470 DCGTGGPSTLQYRLLPLPLGVRAHHDVARLTAFSEVGVPANRTELSMLEPDPRSPFALR- 528 

Qy 372 TGP I SATL - - VMTRP I KG PRE I - QLDLEM I TVNTVI NFRGSS VI RLR I YVSQYPF 423 

h II III I : : I Ml I II II II Ih 

Db 529 - - PLRAGLGAVYTR RALTRAGLYRLTVRAAAP-RHQSVFVLLIAVSPYPY 575 



RESULT 15 
US-09-482-273-245 

; Sequence 245, Application US/09482273 

; Patent No. 6534631 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 71 Human Secreted Proteins 
; FILE REFERENCE: P2030P1 

; CURRENT APPLICATION NUMBER: US/09/4 82 , 273 

; CURRENT FILING DATE: 2000-01-13 

; EARLIER APPLICATION NUMBER: PCT/US99/ 1584 9 

; EARLIER FILING DATE: 1999-07-14 

; EARLIER APPLICATION NUMBER: 60/092,921 

; EARLIER FILING DATE: 1998-07-15 

; EARLIER APPLICATION NUMBER: 60/092,922 

; EARLIER FILING DATE: 1998-07-15 

; EARLIER APPLICATION NUMBER: 60/092,956 

; EARLIER FILING DATE: 1998-07-15 

; NUMBER OF SEQ ID NOS : 267 



SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 245 
LENGTH: 638 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-482-273-245 

Query Match 21.5%; Score 518; DB 4; Length 638; 

Best Local Similarity 29.5%; Pred. No. 3.6e-33; 

Matches 140; Conservative 56; Mismatches 157; Indels 122; Gaps 20; 

Qy 2 CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 

I I I hhlll || Ml | | hi I 
Db 233 CPEGFELDSQGAFCVDVDECAWDAHLCREGQRCVNLLGSYRCLPDCGP 280 

Qy 62 PYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CIN 121 

1 = = : = I Mill h hi I 

Db 281 GFRVADGAGCEDVDECLEGLDDCHYNQLCEN 311 

Qy 122 TEGGYTCSCTDGYWLLEG QCLDI DECRY- -GYCQQLCANVPGSYSCTCNPGFTLNED 176 

I Ih III II -I I | I h III I I II II I 

Db 312 TPGGHRCSCPRGY-RMQGPSLPCLDVNECLQLPKACAYQCHNLQGSYRCLCPPGQTLLRD 370 

Qy 177 GRSC QDVNECATENPCV QTCVNTYGSFI CRCD 208 

l-l hi : I : =| : = = | 

Db 371 GKACTSLERNGQNVTTVSHRGPLLPWLRPWASI PGTSYHAWVSLRPGPMALSSVGRAWCP 430 

Qy 209 PGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILLDDNRSCQDINECE 268 

Ih = ::|| hhlll III I hll I I I -IIIIIMI 

Db 431 PGF - I RQNGV- CTDLDECRVRN - LCQHACRNTEGS YQCLCPAGYRLLPSGKNCQDI NECE 487 

Qy 269 HRNHTCNLQQTC YNLQGGFKC I DP I RCEEP YLR I SDNRCMCPA ENPG CRD 318 

= I I h T ^ I : : h I ||| :|| | 

Db 488 EES I ECGPGQMCFNTRGS YQCVD TPCPATYRQGPSPGTCFRRCSQ 532 

Qy 319 Q PFTILYRDMDWSGRSVPADIFQMQATTR- -YPGAYYIFQIKSGNEGREFYMRQ 371 

I h II : : I | : : : | : | : : | :| 

Db 533 DCGTGGPSTLQYRLLPLPLGVRAHHDVARLTAFSEVGVPANRTELSMLEPDPRSPFALR- 591 

Qy 372 TGP I SATL - - VMTRP I KGPRE I - QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ YPF 423 

I : I I III I : = I HI I II I I II Ih 

Db 592 - - PLRAGLGAVYTR RALTRAGLYRLTVRAAAP-RHQSVFVLLIAVSPYPY 638 



Search completed: January 9, 2004, 12:35:36 
Job time : 16.0551 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 9, 2004, 12:34:12 ; Search time 28.1676 Seconds 

(without alignments) 
3026.121 Million cell updates/sec 

US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLDIDE I NFRGSS VI RLR I YVSQYPF 423 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



747907 seqs, 201509753 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



747907 



Database : Published_Applications_AA: * 

1 : /cgn2__6/ptodata/2/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB .pep : 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB .pep : * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB .pep : * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep:* 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 

10: / cgn2_6 /p t oda t a / 2 /pubpaa /US 0 9 B_PUBC0MB . pep : * 

11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 

13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 

15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 

16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-083-002-2 

; Sequence 2, Application US/09083002 
; Patent No. US20010016650A1 
; GENERAL INFORMATION: 

APPLI CANT : Jacobs , Kenneth 



APPLICANT: McCoy, John M. 
APPLICANT: Racie, Lisa A. 
APPLICANT: LaVallie, Edward R. 
APPLICANT: Merberg, David 
APPLICANT: Treacy, Maurice 
APPLICANT: Evans, Cheryl 
APPLICANT: Agostino, Michael 
APPLICANT: Lu, Zhi j ian 
APPLICANT: Hon j o , Tasuku 
APPLICANT: Tashiro, Kei 
AP PL I CANT : Nakamur a , Tomoyuk i 
TITLE OF INVENTION: SECRETED PROTEINS 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Genetics Institute, Inc. 
STREET: 87 CambridgePark Drive 
CITY: Cambridge 
STATE : MA 
COUNTRY: U.S.A. 
ZIP: 02140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/083,002 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY/AGENT- INFORMATION: 
NAME: Sprunger, Suzanne A. 
REGISTRATION NUMBER: P-4 1,323 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 498-8284 
TELEFAX: (617) 876-5851 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 448 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-083-002-2 



Version #1.30 



Query Match 100.0%; Score 2407; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.3e-183; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 

Qy 
Db 

Qy 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIMIIMIIIIMIIIMIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



61 



86 



120 



TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 

IIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 



121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 



Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

I II I ' 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 , 1 1 1 ! 1 1 1 i Ml I 'I I 

QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGT YFCS CP PG Y I LLDDNRS CQD I NECEHRNHTCNLQQTC YNLQGGFKC I DP I RCEEPYL 300 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIII 

PGTYFCSCPPGYI LLDDNRS CQDINECEHRNHTOSILQQTCYNLQGGFKCI DP I RCEEPYL 325 
RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

I! ! Ill II IN II Jl: II II : ll ill II llll Mil I 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 



IIIIIIIIIIIMIIIIIIIMIIMIIIMIIIIIIIIIIIIMMIIIIIIIIIMM 

GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 

YPF 423 
III 



RESULT 2 
US-09-275-805-2 

; Sequence 2, Application US/09275805 
; Patent No. US20010051358A1 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, HENRI K S. 

APPLICANT: LI, HAODONG 

TITLE OF INVENTION: EXTRACELLULAR EPIDERMAL GROWTH FACTOR 
TITLE OF INVENTION: LIKE PROTEIN 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

STREET: 9410 KEY WEST AVENUE 

CITY: ROCKVILLE 

STATE : MD 

COUNTRY : US 

ZIP: 20850 
COMPUTER READABLE FORM : 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.3 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/275 , 8 05 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/839,525 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: BROOKES, ANDERS A. 

REGISTRATION NUMBER: 36,373 

REFERENCE/DOCKET NUMBER: PF224 



TELECOMMUNICATION INFORMATION: 
TELEPHONE: (301) 309-8509 
TELEFAX: (301) 309-8512 
; INFORMATION FOR SEQ ID NO: 2: 
• SEQUENCE CHARACTERISTICS: 
LENGTH: 44 8 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-275-805-2 



Query Match 100.0%; Score 2407; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.3e-183; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Ov 


1 


OCTNGFDLDROSGOrLDIDErRTI PRACRGDMMCWONGGYLCI PRTNPVYROPY^NPY^ 


60 






MM MM MM II 1 1 II II MM II MM II II II II IIIIIIIMI II II II MINI 




Db 


26 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 


85 


Ov 


61 


TPY^GPYPAAAPPL^APNYPTI ^RPL.1 CRFGYnMnE^NOPVDVDRrATD^HOrNPTnT CJ 


J- £* \J 






Ml 1 II Ml 1 1 Mi 1 1 1 1 1 MM 1 1 II II 1 1 1 1 1 1 1 II 1 II II MM 1 1 1 1 1 II 1 INI 




Db 


86 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 


145 


Qy 


121 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


180 






I i I i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i t i i i i i i i i i i i i 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 




Db 


146 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


205 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


240 






1 1 1 1 1 M 1 II M M 1 1 1 M 1 1 1 1 1 1 II 1 1 II 1 1 M M 1 M II II 1 1 1 1 1 1 1 M 1 1 M M 




Db 


206 


QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVl^CSDMDECSFSEFLCQHECWQ 


265 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 I I I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 




Db 


266 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


325 


Qy 


301 


R I S DNRCMCPAENPGCRDQ PFT I L YRDMD WSGRS VPAD I FQMQATTR Y PGAY Y I FQ I KS 


360 






1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 II 1 II E 1 1 E 1 M 1 1 




Db 


326 


RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKS 


385 


Qy 


361 


GNEGREFYMRQTGP I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 


420 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db < 


386 


GNEGREFYMRQTGP I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRI YVSQ 


445 


Qy 


421 


YPF 423 




Db 


446 


III 

YPF 448 





RESULT 3 
US-09-836-561-1 

; Sequence 1, Application US/09836561 
; Patent No. US20020038006A1 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

Corley, Neil C. 
; Guegler, Karl J. 

TITLE OF INVENTION: HUMAN EXTRACELLULAR MATRIX PROTEINS 



NUMBER OF SEQUENCES: 6 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: USA 
ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/83 6 , 561 
FILING DATE: 16-Apr-2001 
CLASS I FI CATI ON : <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/212,168 
FILING DATE: <Unknown> 
ATTORNEY /AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0333 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
TELEX: <Unknown> 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 448 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: CORNNOT01 
CLONE: 45517 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-836-561-1 

Query Match 100.0%; Score 2407; DB 9; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.3e-183; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I Ml III I III I II II MINIM 1 1 MM Ml II I II ill Ml 1 1 II 1 1 II II III III 

Db 26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

I I I M M I I I I I II I II II II I II I I I I II I I I I II I I II II I II I I II I I I I I II I I I I 
Db 86 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 145 

Qy 121 NTEGG YTCS CTDG YWLLEGQCLD I DECR YG YCQQLCANVPGS YS CTCN PGFTLNEDGRS C 180 

I! ILJI :ll II llhll Ml lllllllilllillllll lllllll 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 



f 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 

Qy 301 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKS 360 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 32 6 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYY I FQ I KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPREI QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 420 

MMMIIIIIIIIIIIIIIMIIIIIMIIIIIIIMIIIIIIIIMIIIIIIIIIIII 

Db 386 GNEGREF YMRQTGP I SATLVMTRP I KGPREI QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 4 
US-10-041-016-2 

; Sequence 2, Application US/10041016 
; Publication No. US20020165151A1 
GENERAL INFORMATION: 

APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
; Racie, Lisa A. 

; LaVallie, Edward R. 

; Merberg, David 

; Treacy, Maurice 

Evans , Cheryl 
Agostino, Michael 
; Lu, Zhijian 

; Honjo, Tasuku 

TITLE OF INVENTION: SECRETED PROTEINS 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genetics Institute, Inc. 
STREET: 87 CambridgePark Drive 
CITY: Cambridge 
STATE: MA 
COUNTRY: U.S.A. 
ZIP: 02140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/l 0/04 1 , 016 
FILING DATE: 07-Jan-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/083,002 
FILING DATE: 21 -MAR- 19 98 
ATTORNEY/AGENT INFORMATION: 



NAME: Sprunger, Suzanne A. 
REGISTRATION NUMBER : P -4 1,323 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 498-8284 
TELEFAX: (617) 876-5851 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 448 amino acids 
TYPE: amino acid 

STRANDEDNESS : No. US20020165151A1 Relevant 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-041-016-2 

Query Match 100.0%; Score 2407; DB 14; Length 448; 

Best Local Similarity 100.0%; Pred. No. 3.3e-183; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 6 0 

I I I I I I I M I . M I II I I I I I I I I I I I I I I I I I I I M I I I I I I | I I | I | | | | | | | | 
QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 8 5 

TPYSGP YPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IMIIIIIIIIIIIIIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
NTEGG YTCS CTDG YWLLEGQCLD I DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 180 

II Ml MM IMMM IMIIIMM III MM MM II III III II 

NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 2 05 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

IM II II II III III II Ml IM Ml IM IIIIIMMIMI 

QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 3 00 

IIMIMIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIMMIIIIIIIMIIIIMI 

PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 

RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

! ■ ■ ' I ■ ' i , ' I I I ■ I : I I i I I ' I I i 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KS 385 

GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 42 0 

II Ml MM II III MIMIIMM II MM MM IMMMMIM I 

GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLR I YVSQ 445 

YPF 423 
III 



RESULT 5 

US-10-199-672-408 

Sequence 408, Application US/10199672 
Publication No. US20030148442A1 
GENERAL INFORMATION: 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



APPLICANT: Baker, Kevin P. 
APPLICANT: Chen,Jian 
APPLI CANT : Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J . 
APPLICANT: Gurney , Aust in L. 
APPLI CANT : Pan , James 
APPLICANT: Smith , Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C1 

CURRENT APPLICATION NUMBER : US/10/199 , 672 
CURRENT FILING DATE: 2 002-07-18 
PRIOR APPLICATION NUMBER: US/10/052 , 586 
PRIOR FILING DATE: 2002-01-15 
PRIOR APPLICATION NUMBER: 60/059263 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/059266 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/063120 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063121 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063486 
PRIOR FILING DATE: 1997-10-21 
PRIOR APPLICATION NUMBER: 60/063540 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063541 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063544 
PRIOR FILING DATE: 1997-10-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-199-672-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 422 ; Conservative 



Score 2401; DB 12 
Pred. No. 9.9e-183 
0; Mismatches 1 



Length 44 8; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

MMIIIIIIIIIIMIIIMIIIJIIIMIIIIIIIIIIIIIIIIMIIIIIIMIIII 

26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 
Db 



61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

MIMIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 



Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 







1 1 I I 1 I I I I I I I I I I I t I I I I I I I I i I I I I I I i i I i i i r i i i i i i i i i i i i i i i i i i i i t 

1 1 1 II II 1 II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 


Db 


146 


NTEGG YTCS CTDG YWLLEGQCLD I DECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 






1111111111111111111111 i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
Illlllllllllllllllllll 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


Db 


206 


QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 






i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


266 


PGTYFCSCPPGY I LLDDNRSCQDI NECEHRNHTCNLQQTC YNLQGGFKC I DP I RCEEPYL 325 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KS 360 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 


Ov 


361 


GNEGREFYMROTGP I SATLVMTRP I KGPRE T OT .DT .RMT TVNTVT NPRHQC? VT PT .R T VVQD 4 9 0 






IIIIIIIIIIMIIIIIMIIIIIIIIIIMIIIIIIMIIIIIMIIIIIIMMIIII 


Db 


386 


GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 445 


Qy 


421 


YPF 423 


Db 


446 


III 

YPF 448 



RESULT 6 

US-10-187-749-408 

Sequence 408, Application US/10187749 
Publication No. US20030153036A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Baker, Kevin P. 
Chen, Jian 
Desnoyers , Luc 
Goddard , Audrey 
Godowski , Paul J . 
Gurney, Austin L. 
Pan, James 
Smith, Victoria 
Watanabe, Colin K. 
Wood, William I. 
Zhang, Zemin 



TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C1 

CURRENT APPLICATION NUMBER: US/10/187 , 749 

CURRENT FILING DATE: 2002-07-01 

PRIOR APPLICATION NUMBER: US/10/052 , 586 

PRIOR FILING DATE: 2002-01-15 

PRIOR APPLICATION NUMBER: 60/059263 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER: 60/059266 

PRIOR FILING DATE: 1997-09-18 

PRIOR APPLICATION NUMBER: 60/062250 

PRIOR FILING DATE: 1997-10-17 

PRIOR APPLICATION NUMBER: 60/063120 

PRIOR FILING DATE: 1997-10-24 

PRIOR APPLICATION NUMBER: 60/063121 

PRIOR FILING DATE: 1997-10-24 



PRIOR APPLICATION NUMBER: 60/063486 
PRIOR FILING DATE: 1997-10-21 
PRIOR APPLICATION NUMBER: 60/063540 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063541 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063544 
PRIOR FILING DATE: 1997-10-28 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE : PRT 

ORGANISM: Homo Sapien 
US-10-187-749-408 

Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-183; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I ; ■ I i : i ; I i ■ ■ I i 1 . I ' : i i ! I I i I 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 8 5 
TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 14 5 
NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

iiiiiiimimiiiiiiiiiiiiiiiiiimiiiiiiimiiiiimiiiiiiimiii 

NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 
QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

IIIIIIMIIIIIIIIIIIII! IIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIII 

QDWECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 300 

IIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIIIIIMIIIII 

PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 325 
RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYIFQI KS 360 

MIMIilll Ml IHIIIIIMIIIilMMI III Ml II II 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KS 385 
GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLRIYVSQ 42 0 

M: I M . I I I Ml l.i ill ,! I ' 

GNEGREFYMRQTG P I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGSSVI RLR I YVSQ 445 
YPF 423 

Ml 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



RESULT 7 

US-10-194-457-408 

; Sequence 408, Application US/10194457 
; Publication No. US20030153037A1 



GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLI CANT : Chen , Jian 
APPLICANT: Desnoyers , Luc 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gumey, Austin L. 
APPLI CANT : Pan , James 
APPLICANT: Smith, Victoria 
APPLICANT: Watanabe , Col in K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P343 0R1C2 96 
CURRENT APPLICATION NUMBER: US/10/194 , 457 
CURRENT FILING DATE: 2002-07-11 
PRIOR APPLICATION NUMBER: 10/052586 
PRIOR FILING DATE: 2002-01-15 
PRIOR APPLICATION NUMBER: 60/059263 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/059266 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/063120 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063121 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063486 
PRIOR FILING DATE: 1997-10-21 
PRIOR APPLICATION NUMBER: 60/063540 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063541 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063544 
PRIOR FILING DATE: 1997-10-28 

Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 44 8 
TYPE : PRT 

ORGANISM: Homo Sapien 
US-10-194-457-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 422; Conservative 



Score 2401; DB 12; Length 448; 
Pred. No. 9.9e-183; 
0; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 

Db 



61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

Miillli Ml IMillillMIIMIIMIII MIMIMIII IIIIIIM | 

86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 



Qy 


121 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


180 






1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 




Db 


146 


NTEGGYTCSCTDGYWLLEGOCLDIDECRYGYCOOLCANVPGSYSCTCNPGFTLNEDGRSC 


£* \J -J 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


240 






1 lllllllll MM! MM III II II 1 1 M 1 II 1 M M 1 1 1 1 1 1 M 1 1 1 M II 1 1 II 1 




Db 


206 


ODVNECATENPCVOTCVNTYG^LI rRnDPfiYELEEDfiVHC^DMnEr^FCIRFT .fOHRnVNO 




Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


300 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II II 1 1 1 M II 1 1 1 1 1 1 II 1 1 1 M 1 1 1 M 1 II 1 




Db 


266 


PGTYFCSCPPGYI LLiDDWR^PODT MErEHRNHTPMT.OOTrYMT .OnflFKTT DP T R PPPPYT ■ 


jzj 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTR Y PGAY Y I FQ I KS 


360 






Ml II 1 MM 1 1 II 1 Ml II 1 II II II II II II II II 1 1 II 1 Mill 1 MM 1 1 1 III 




Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTR Y PGAY Y I FQ I KS 


385 




jDl 


PMUPDPDVMD^TPDTCATT T7MTDDT VPDDDT AT HT fPft/I T T"T 7"M rPT 7TMDnnP HT tt TIT T"> t VT THA 

LjJNli(jKnjr iMKy lUrloAlijvNIKFlRLjFKbiyijlJijbNl 1 VIM I VINr RbbbVI RLRI YVSQ 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


386 


GNEGREF YMRQTG P I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ 


445 


Qy 


421 


YPF 423 




Db 


446 


III 

YPF 448 





RESULT 8 

US-10-184-642-408 

Sequence 408, Application US/10184642 
Publication No. US20030157635A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLI CANT : Chen , Jian 
APPLICANT: Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney , Aust in L. 
APPLI CANT : Pan , James 
APPLICANT : Smith , Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C194 
CURRENT APPLICATION NUMBER: US/10/184 , 642 
CURRENT FILING DATE: 2002-06-27 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 612 
SEQ ID NO 408 
LENGTH: 448 
TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-184-642-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 422; Conservative 



Score 2401; DB 12 
Pred. No. 9.9e-183 
0; Mismatches 1 



Length 44 8; 
Indels 0; Gaps 



0; 



Qy 


1 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 


60 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

II 1 1 II 1 i 1 1 1 1 1 11 II II 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 II III 




Db 


26 


QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 


85 


Qy 


61 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 


120 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 




Db 


86 


TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYOMDESNOCVDVDECATDSHOCNPTOI CI 




Qy 


121 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


146 


NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 


205 


Qy 


181 


QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 


265 


Qy 


241 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 




Db 


266 


PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 


~\0 R 
j 


Qy 


301 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 


360 






IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIII 




Db 


326 


RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 


385 


Qy 


361 


GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRI YVSQ 


420 






MIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


386 


GNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDLEMI TVNTVINFRGSS VI RLRI YVSQ 


445 


Qy 


421 


YPF 423 




Db 


446 


III 

YPF 448 





RESULT 9 

US-10-196-747-408 

Sequence 408, Application US/10196747 
Publication No. US20030162250A1 
GENERAL INFORMATION: 
APPLICANT: Baker , Kevin P. 
APPLI CANT : Chen , Jian 
APPLI CANT : Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLI CANT : Pan , James 
APPLICANT : Smith , Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C346 

CURRENT APPLICATION NUMBER: US/ 10/196 , 747 
CURRENT FILING DATE: 2 002-07-16 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 



SEQ ID NO 4 08 
LENGTH: 448 
TYPE: PRT 

ORGANISM : Homo Sap i en 
US-10-196-747-408 



Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No . ' 9 . 9e-183 ; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


UD 


one 
£ Ub 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



QCTNGFDLDRQSGQCLDI DECRTI P EACRGDMM CVNQNGG YLC I PRTNPVYRGPYSNPYS 

III lllllllllllll MM MINIM MINIM Mill INI MM llllillllll 

QCTNGFDLDRQSGQCLDI DECRTI P EACRGDMMCVNQNGG YLC I PRTNPVYRGPYSNPYS 
TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIIIIMIIIIIIIIIIMII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 
NTEGGYTCSCTDGYVJLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 

llllllllllllllllillllllllllMIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

NTEGG YTCS CTDG YWLLEGQCLD I DE CR YG YCQQLCANVPGS YS CTCN PGFTLNEDGRS C 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 

llllllllllllllllllllll IIIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIII 

QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 

PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 

I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I II I I I I I II I I I I I I I I II II I I II I I I 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 

RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRY PGAY Y I FQI KS 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIII 

RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRY PGAYY I FQI KS 
GNEGREF YMRQTG P I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ 

M 'IN hi II II II II, I hill MUM II III II! 

GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSSVI RLR I YVSQ 



II 



60 



85 



120 



145 



180 



205 



240 



265 



300 



325 



360 



385 



420 



445 



RESULT 10 
US-10-173-689-408 

Sequence 408, Application US/10173689 
Publication No. US20030166104A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Baker, Kevin P. 
Chen, J i an 
Desnoyers , Luc 
Goddard , Audrey 
Godowski , Paul J . 
Gurney, Austin L. 
Pan , James 
Smith, Victoria 
Watanabe, Colin K. 



; APPLICANT: Wood, Will iam I. 
; APPLICANT: Zhang, Zemin 

; TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
; TITLE OF INVENTION: ACIDS ENCODING THE SAME 
; FILE REFERENCE: P3430R1C10 

; CURRENT APPLICATION NUMBER: US/10/ 173 , 68 9 
; CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
; NUMBER OF SEQ ID NOS : 612 
; SEQ ID NO 4 08 

LENGTH: 44 8 

TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-173-689-408 

Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-183; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I I II I I I I I I I I I I I I I I I III Mlllllll II I I I I I II Mill I I I I III II III 
Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IMIIIIIMIIIIIIIIIMIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIMIMIIIIIIMI 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

IIIIIMIIMIIIIMIIIII IMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 206 QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 300 

I I II II I II I III I I I I II I I I I III I I II I I I II II I II I II III I I I I I I I I III M 
Db 266 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 325 

Qy 301 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYI FQI KS 360 

IIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIMII 

Db 326 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDLEMI TVNTVINFRGSSVI RLRI YVSQ 420 

IIIIIIIIIIMIIIIIIIIMIIIMIIIMIIIIIIIIIIIIIIMIIMMIIIIII 

Db 386 GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLR I YVSQ 445 

Qy 421 YPF 423 

III 

Db 446 YPF 448 



RESULT 11 
US-10-173-690-408 

; Sequence 408, Application US/10173690 
; Publication No. US20030166105A1 
; GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Baker, Kevin P. 
Chen, Jian 
Desnoyers , Luc 
Goddard , Audrey 
Godowski, Paul J. 
Gurney, Austin L. 
Pan, James 
Smith, Victoria 
Watanabe, Colin K. 
Wood, William I . 
Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C9 

CURRENT APPLICATION NUMBER: US/10/173,690 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 44 8 
TYPE: PRT 

ORGANISM: Homo Sapien 
US-10-173-690-408 

Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-183; 

Matches 422; Conservative .0; Mismatches 1; Indels 0; Gaps 0; 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IMIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMII 

QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 8 5 
TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 120 

IIIIIIIIMIMIIIIIIINIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 14 5 
NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

i - 1 1 1 ' E [ 111,111 j I 1 1 j I I , 1 1 1 [ I I 1 1 1 , [ : j [ 

NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 2 05 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

i 1 1 1 1 1 MM , 1 1 ' ! I IM II II Mill II: IIMMMi II 

QDWECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIIMIIIIIIIIIIIMIIIIMIIMIIIIIIMMIIIIIIIIIIIIIIIII 

PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 
R I S DNR CMC PAEN PGCRDQ P FT I L YRDMD WSGRS VPAD I FQMQATTR Y PGA Y Y I FQI KS 360 

Ml Ml III Ml Ml IMIMM II MMMIImMMMIMM 

RI SDNRCMCPAENPGCRDQPFTI L YRDMD WSGRS VPAD I FQMQATTR Y PGA YY I FQI KS 385 
GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 42 0 

IIIIIIIIIMIIIIIIIIIIIIMIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

GNEGREFYMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 445 
YPF 423 



Qy 


i 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 



Db 



446 



III 

YPF 



448 



RESULT 12 
US-10-173-691-408 

Sequence 408, Application US/10173691 
Publication No. US20030166106A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLI CANT : Chen , Jian 
APPLI CANT : Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLI CANT : Pan , James 
APPLICANT: Smith , Victoria 
APPLICANT: Watanabe , Col in K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C13 

CURRENT APPLICATION NUMBER: US/10/173 , 691 
CURRENT FILING DATE: 2 002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 44 8 
TYPE : PRT 

ORGANISM: Homo Sap i en 
US-10-173-691-408 

Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-183; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIPRTNPVYRGPYSNPYS 60 

IIIIMIMIIIIIIMIIIIIIIIIIMMIIIIIIIIIIIMMIMMIMMIIII 

Db 26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 8 5 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

IMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IMIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIMIIIIMIIMIIIIIIII 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDWE(^TENPCVQTCWTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

II MINIM INI MM MM II 1 1 1 1 1 II 1 1 II II I II I II II II II 1 1 1 II II 1 1 1 

Db 206 QDWEt^TENPCVQTCWTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

266 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 



Qy 301 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 326 R I SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPAD I FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSSVI RLR I YVSQ 420 

IIIIIIIIIIIIIIIMIMIIIIIIIMIIIIIIIIIIIIIIIIMIIMIIIMIIII 

Db 386 GNEGREFYMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSSVI RLR I YVSQ 445 

Qy 421 YPF 423 

Ml 

Db 446 YPF 448 



RESULT 13 
US-10-173-692-408 

Sequence 408, Application US/10173692 
Publication No. US20030166188A1 
GENERAL INFORMATION: 
APPLICANT : Baker, Kevin P. 
APPLI CANT : Chen , Jian 
APPLI CANT : Desnoyer s , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney, Austin L. 
APPLI CANT : Pan , James 
APPLICANT : Smith , Victoria 
APPLICANT: Watanabe , Col in K. 
APPLI CANT : Wood , Wi 1 1 iam I . 
APPLI CANT : Zhang , Zemin 

TITLE OF -INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C20 

CURRENT APPLICATION NUMBER: US/lO/173 , 692 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 4 08 
LENGTH: 44 8 
TYPE: PRT 

ORGANISM: Homo Sap i en 
US-10-173-692-408 



Query Match 99.8%; 
Best Local Similarity 99.8%; 
Matches 422; Conservative 



Score 24 01; DB 12; 
Pred. No. 9.9e-183; 
0; Mismatches 1; 



Length 448; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIIMIIIIIIIIMIMIIIMIIIIIIIIIIIIIIIIMIIMIIMIIIIIIMI 

26 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 
Db 

Qy 
Db 



61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

i i - 1 . 1 ! . : . I ; . i . 1 1 ; I - 1 ! i . I - ! 1 1 

86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
121 NTEGGYTCS CTDG YWLLEGQCLD I DECRYGYCQQLCANVPGS YSCTCNPGFTLNEDGRSC 180 

IIIIIIIMMIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIMI 

146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 



Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 

IIIIIIIIIIIMIIMMIII IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 

IIIIIMIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIMMIIMI 

PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 



240 



265 



300 



325 



RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

IIIIIIIIMIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 
GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGS S VI RLR I YVSQ 420 

: 1 1 i : i I : 1 1 I ■ : i ; ; i 1 1 1 i : . i 

GNEGREFYMRQTG PI SATLVMTRP I KGPREI QLDLEMI TVNTVINFRGSSVI RLR I YVSQ 445 



RESULT 14 
US-10-173-694-408 

Sequence 408, Application US/10173694 
Publication No. US20030166107A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLI CANT : Chen , Jian 
APPLI CANT : Desnoyers , Luc 
APPLI CANT : Goddard , Audrey 
APPLICANT: Godowski , Paul J. 
APPLICANT: Gurney , Aust in L. 
APPLI CANT : Pan , James 
APPLICANT : Smith , Victoria 
"APPLICANT: Watanabe , Col in K. 
APPLICANT: Wood, William I. 
APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C19 

CURRENT APPLICATION NUMBER: US/10/173 , 694 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 4 08 
LENGTH: 448 
TYPE : PRT 

ORGANISM: Homo Sap i en 
US-10-173-694-408 



Query Match 99.8%; Score 2401; DB 12 

Best Local Similarity 99.8%; Pred. No. 9.9e-183 
Matches 422; Conservative 0; Mismatches 1 



Length 448; 
Indels 0; 



Gaps 



0; 



Qy 



1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMNIIMIIMMIIIIIMIII 



26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 
TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 120 

MIMIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIII 

NTEGG YTCS CTDG YWLLEGQCLD I DECR YG YCQQLCAN VPGS YS CTCN PGFTLNEDGRS C 205 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

II! II !lll MM II I 1 1 1 , 1 1 ! 1 1 1 i 1 1 1 [ I . I j 1 1 1 1 j 1 1 : 1 1 

QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

M III I i II III Ml II II MMM II MM IMMII II 

PGT YFCS CP PG Y I LLDDNRS CQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 325 
RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRY PGAY Y I FQI KS 360 

IMIIIIIIIIMIIIIIIIMIIIMIMIMIIIIIMIIIIMMIMIIIIIIIII 

R I SDNRCMC PAEN PGCRDQ PFT I L YRDMD WSGRS VPAD I FQMQATTRYPGAYYI FQI KS 385 
GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 42 0 

II Ml IMMII II III :|| II II Ml MM III II III 

GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQ 445 

YPF 423 
III 



RESULT 15 
US-10-173-698-408 

Sequence 408, Application US/10173698 
Publication No. US20030166108A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Chen, Jian 

APPLICANT : Desnoyers , Luc 

APPLI CANT : Goddard , Audrey 

APPLICANT: Godowski , Paul J . 

APPLICANT: Gurney, Austin L. 

APPLI CANT : Pan , James 

APPLICANT : Smith, Victoria 

APPLICANT: Watanabe, Colin K. 

APPLICANT: Wood , Wil 1 iam I. 

APPLI CANT : Zhang , Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P343 0R1C12 

CURRENT APPLICATION NUMBER: US/10/173 , 698 
CURRENT FILING DATE: 2002-06-17 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 408 
LENGTH: 44 8 
TYPE : PRT 



Db 


26 


Qy 


61 


Db 


86 


QY 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Ov 


J D J. 


Db 


386 


Qy 


421 


Db 


446 



ORGANISM: Homo Sap i en 
US-10-173-698-408 

Query Match 99.8%; Score 2401; DB 12; Length 448; 

Best Local Similarity 99.8%; Pred. No. 9.9e-183; 

Matches 422; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGG YLC I PRTNPVYRGPYSNPYS 60 

" M ' I M : l I I! i, , ; h IMIi'i .1 

QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGG YLC I PRTNPVYRGPYSNPYS 8 5 
TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIIIIIIMIIIIIIII 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 
NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIII 

NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 2 05 
QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

. .M il :i i IIMIIIMIIIIIIIMIIIIIIIIIIIIIIMIM 

QDVNECATENPCVQTCVNTYGSLICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 325 
RI SDNRCMC PAENPGCRDQ P FT I L YRDMD WSGRS VPAD I FQMQATTRYPGAYYI FQIKS 360 

IMM III III IIMII II Jl.ill II II; Mi II Mill 

RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQIKS 385 
GNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDLEMI TVNTVINFRGSSVI RLRI YVSQ 42 0 

LMI'UI, III II II 111:111 II ' II llli'MIIII 'II II 

GNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSS VI RLR I YVSQ 445 

YPF 423 
III 



Qy 


1 


Db 


26 


Qy 


■61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



Search completed: January 9, 2004, 12:43:59 
Job time : 29.1676 sees 



GenCore version 5,1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



January 9, 2004, 12:29:16 ; Search time 16.0264 Seconds 

(without alignments) 
2538.270 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLDIDE I NFRGS S VI RLR I YVSQY P F 423 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched: 



283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 283308 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_76:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No, Score Match Length DB ID Description 



1 


1108.5 


46 


. 1 


493 


2 


JC5621 


epidermal growth f 


2 


963.5 


40 


. 0 


387 


2 


138449 


extracellular prot 


3 


744.5 


30 


.9 


685 


2 


S78040 


fibulin, splice fo 


4 


735.5 


30 


.6 


683 


2 


C36346 


fibulin 1 precurso 


5 


729.5 


30 


.3 


1221 


2 


A49457 


fibulin-2 precurso 


6 


702 


29 


.2 


705 


2 


S34968 


fibulin, splice fo 


7 


701.5 


29 


1 


1184 


2 


A55184 


fibulin-2 precurso 


8 


617.5 


25 


7 


601 


2 


B36346 


fibulin 1 precurso 


9 


603 


25 


1 


689 


2 


T42760 


fibulin, splice fo 


10 


596 


24 


8 


712 


2 


T42990 


fibulin 1, splice 


11 


592.5 


24 


6 


589 


2 


T43210 


fibulin- ID precurs 


12 


574 


23 


8 


798 


2 


T22793 


hypothetical prote 


13 


539 


22 


4 


1394 


2 


A35626 


transforming growt 



14 


523 . 5 


21 


r 7 


3002 


2 


A4 722 1 


lliJX 111111 1 pi CLUI 


15 


522 . 5 


21 


. 7 


2871 


2 


A55567 


fibrillin I - bnvi 


16 


518 


21 


. 5 


1712 


2 


A38261 


maskina Drotein cr 


17 


509 . 5 


21 


. 2 


2871 


2 


■n. j j o £* i 


i-lJJl 111 111 1 pi CL Lll 


18 


509 . 5 


21 


. 2 


2 907 


2 


A57278 


"F H Tit* HI 1 "i n — 0 nrpn i y* 
iiui iiiin ^ pi clui 


19 


508 . 5 


21 


. i 


2918 


2 


A54105 


■F -i "hy* H 1 1 -jn-O r , jT"^r , i i y* 

IIJJl 111111 £t pi CLUl 


20 


469 . 5 


19 


. 5 


1820 


2 


A55494 


l3t~pn1" r T*sn Q f or*m "i n 

Xu UC11L LI A11D LU1 llllll 


21 


455 


18 


. 9 


741 


2 


T46488 


hvnnf hpt" i r 1 ^ 1 T>r*rit"p 

i F l^ul piULC 


22 


448.5 




. o 






T979ft 7 

1 £t / ^OJ 


nypoLnetiCai proce 


23 


438 


18 


. 2 


1251 


2 


A57293 


l^t" p»ti t~ tT^nQfnrniiri 

IQLCllLi Ll CllliS 1UI Llllll 


24 


436 


18 


. 1 


1574 


2 


T13954 




25 


392 


16 . 


. 3 


886 


2 


A57172 


piUJJaJJic liuitliuilt: i 


26 


384 . 5 


16 . 


. 0 


3507 


2 


T34513 


i jy pv_; uiic l, n_ct i piuuc 


27 


368 . 5 


15 . 


. 3 


1106 


2 


T1R7^ 9 

± J- O I J .7 


liyptJUIlcL, It-cll piOLc 


28 


367 


15 , 


. 2 


2471 


2 


A4 9128 


LCll 1 ct L. C ,Uc L. cl llllll 


29 


364 


15 , 


. l 


810 


2 


T10756 


wci i ilju uj i uy uiuLci 


30 


361 


15 , 


. 0 


1081 


2 


T31329 


ICUC^LUI LjrlUDlllC 


31 


359 


14 , 


. 9 


1203 


2 


A4 9 1 7 5 


rlULLll D pi ULClll 


32 


356 


14 . 


. 8 


2555 


2 


A4 0043 


IlUL.t_Il piULclll IlUllUJ 


33 


353 . 5 


14 . 


. 7 


1964 


2 


T09059 


■nnt~r" , Vi4. — mm ico 

HULLll^ IIILJLlOC 


34 


351 . 5 


14 , 


. 6 


2437 


2 


S42612 


L* 1 dllO UICIILU1 CXI 1C pi UL 


35 


348 . 5 


14 . 


. 5 


2703 


1 


A24420 


rinh r*Y\ nrnf^i n _ f r 

ilULLU pi UL.C111 11 


36 


340 . 5 


14 . 


. l 


673 


2 


A48089 


yiuwuii aiicoL spcL 


37 


340 


14 . 


, l 


2524 


2 


A35844 


AULLU piULclll — r\ L 


38 


335 . 5 


13 , 


, 9 


2531 


2 




ilULL.il pi (J Lc 111 IlOITKj 


39 


335 


13 . 


. 9 


674 


2 


155476 


crrowth not - f*r\t iat*in 


40 


334 - 


13. 


. 9 


511 


2 


T17298 


hypothetical prote 


41 


334 


13. 


. 9 


2531 


2 


T31070 


notch homolog - se 


42 


333 


13. 


.8 


1064 


2 


A40136 


fibropellin la - s 


43 


333 


13 . 


.8 


2321 


2 


S78549 


notch3 protein - h 


44 


325.5 


13 . 


.5 


2531 


2 


A46019 


notch- 1 protein - 


45 


324 .5 


13 . 


,5 


2318 


2 


S45306 


notch 3 protein - 



ALIGNMENTS 



RESULT 1 
JC5621 

epidermal growth factor-like protein, T16 precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 09-Oct-1997 #sequence_revision 07-Nov-1997 #text_change 05-Nov-1999 
C; Access ion: JC5621 

R;0zaki, T. ; Kondo, K. ; Nakamura, Y . ; Ichimiya, S.; Nakagawara, A.; Sakiyama, *S. 
Biochem. Biophys . Res. Commun. 237, 245-250, 1997 

A;Title: Interaction of DA41, a DAN-binding protein, with the epidermal growth 
factor-like protein, S(l-5). 

A; Reference number: JC5621; MUID : 97415782 ; PMID:9268694 
A; Accession: JC5621 
A /Molecule type: mRNA 
A/Residues : 1-493 <0ZA> 

A; Cross-references: DDBJ:D89730; NID:g2429082; PIDN : BAA22265 . 1 ; PID : dl023 127 ; 
PID:g2429083 

C;Comment: This protein plays a role in the regulation of cell growth by 
interacting with DAN protein through DA41 protein. 
C; Keywords : glycoprotein 

F; 1-17/Domain: signal sequence #status predicted <SIG> 



F;28-70, 158-199,200-237, 238-277, 278-318, 319-359/Region: epidermal growth factor- 
like repeat 

F,-249/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 46.1%; Score 1108.5; DB 2; Length 4 93; 

Best Local Similarity 43.6%; Pred. No. 6.7e-66; 

Matches 204; Conservative 68; Mismatches 149; Indels 47; Gaps 5; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

MIM- I II Mill :hlhl I Ml I M h : 

Db 28 QCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGYLCLPKTAQIIVNNEQPQQE 87 

Qy 61 TPYS GPYPAAAPPLSAPNYPT 81 

II : I = hi - I I 

Db 88 TPAAEASSGAATGTIAARSMATSGVI PGGGFIASATAVAGPEVQTGRNNFVIRRNPADPQ 147 

Qy 82 ISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWL 136 

I hill : M I hill I HI I || 

Db 14 8 RI PSNPSHRI QCAAGYEQSEHNVCQDI DECTSGTHNCRLDQ VCI NLRGSFTCHCLPGYQK 207 

Qy 137 LEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQT 195 

I II I I I Mh I Mill I : M hill I I I 

Db 208 RGEQCVDIDECSVPPYCHQGCVNTPGSFYCQCNPGFQLAANNYTCVDINECDASNQCAQQ 267 

Qy 196 CVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILL 255 

II 1 1 II h h I II I I : M hill I Ml MM I hi I : I II II - 

Db 268 CYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYQW 327. 

Qy 256 DDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPG 315 

MMIMIMI II : hi Mhl hMh IMIIMI I 

Db 328 -RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPQNPCQDPYVLTSENRCVCPVSNTM 385 

Qy 316 CRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI 375 

III I MM: |:: | ||||:|MIMIII I IMIIMI llhlll h 

Db 386 CRDVPQSIVYKYMNIRSDRSVPSDIFQIQATTI YANTINTFRIKSGNENGEFYLRQTSPV 445 

Qy 376 SATLVMTRP I KG PRE I QLDLEM I TVNTVI NFRGSS VI RLR I YVSQYPF 423 

II Ih : = MM : L II MIMI I I : I 

Db 446 SAMLVLVKSLTGPREHIVGLEMLTVSSIGTFRTSSVLRLTI IVGPFSF 493 

RESULT 2 
138449 

extracellular protein - human 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 
C; Access ion: 138449 
R;Lecka-Czernik, B.; Lumpkin, C.K. 
Mol. Cell. Biol. 15, 120-128, 1995 

A; Title: An overexpressed gene transcript in senescent and quiescent human 
fibroblasts encoding a novel protein in the epidermal growth factor-like repeat 
family stimulates DNA synthesis. 

A; Reference number: 138449; MUID: 95097983 ; PMID:7799918 
A; Access ion: 138449 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-387 <RES> 



A; Cross-references: EMBL:U03877; NID:g458227; PIDN : AAA65590 . 1 ; PID:g458228 
C; Genetics : 
A;Gene: Sl-5 

Query Match 4 0.0%; Score 963.5; DB 2; Length 387; 

Best Local Similarity 48.3%; Pred. No. 1.9e-56; 

Matches 171; Conservative 55; Mismatches 123; Indels 5; Gaps 4; 



Qy 71 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSC 13 0 

M l : l I : I I h III hill :| I hill | : | | 
Db 38 ADPQRI PSNP - - SHRI QCAAGYEQSEHNVCQDI DECTAGTHNCRADQVCINLRGSFACQC 95 

Qy 131 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 

II Ihlllll II I I I llh I hill I : :| hill 

Db 96 PPGYQKRGEQCVDIDECTIPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 190 NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 249 

I I I I I Mllhh I I I I I ::| hill I :|lh:|lhll : I II 
Db 156 NQCAQQCYNI LGSF I CQCNQGYELSSDRLNCEDI DECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 250 PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMC 309 

M - Mil II : hi ||h| h:||: I 

Db 216 QGYQW- RSRTCQDI NECETTNE - CREDEMCWNYHGGFRCYPRNPCQDP YI LTPENRCVC 273 

Qy 310 PAENPGCRDQPFTI LYRDMDWSGRS VPADI FQMQATTRYPGAYYI FQI KSGNEGREFYM 369 

I I lh I = |='|: I : I 1 . I ! I | hllMII llh 

Db 274 PVSNAMCRELPQS I VYKYMSI RSDRSVPSDI FQI QATTI YANTI NTFRI KSGNENGEFYL 333, 

Qy 370 RQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 423 

III hll lh : : I I I I 'MMI- || I I | | : | 
Db 334 RQTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTI I VGPFSF 387 



RESULT 3 
S78040 

f ibulin, splice form C precursor - mouse 

N; Alternate names: basement -membrane protein BM-90 

C; Species: Mus musculus (house mouse) 

C;Date: 24-Jul-1998 #sequence_revision 24-Jul-1998 #text_change 02-Aug-2002 
C;Accession: S78040; S78560; S36440 

R;Pan, T.C.; Kluge, M. ; Zhang, R.Z.; Mayer, U. ; Timpl , R. ; Chu, M.L. 
Eur. J. Biochem. 215, 733-740, 1993 

A;Title: Sequence of extracellular mouse protein BM-90/f ibulin and its calcium- 
dependent binding to other basement -membrane ligands. 
A;Reference number: S34968; MUID: 93358897; PMID:8354280 
A; Accession: S7804 0 
A; Molecule type: mRNA 
A;Residues: 1-685 <PAN> 
A; Cross -references : EMBL:X70854 

R;Pan, T.C.; Kluge, M. ; Zhang, R.Z.; Mayer, U. ; Timpl, R. ; Chu, M.L. 
submitted to the EMBL Data Library, January 1993 

A;Description: Sequence of extracellular mouse protein BM-90/f ibulin and its 

calcium-dependent binding to other basement membrane ligands. 

A; Reference number: S3644 0 

A; Access ion: S78560 

A; Molecule type: mRNA 

A;ResidueS: 1-39, 'P' ,41-685 <CHU> 



A /Cross-references : EMBL:X70854 
C;Genetics : 
A;Introns: 568/3 

C;Superfamily : fibulin-1; EGF homology 

C;Keywords: alternative splicing; basement membrane; calcium binding; 

extracellular matrix; glycoprotein; plasma 

F ; 1 - 2 9 / Doma in : signal sequence #status predicted <SIG> 

F;30-685/Product : fibulin, splice form C #status predicted <MAT> 

F;98, 537, 541/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 30.9%; Score 744.5; DB 2; Length 685; 

Best Local Similarity 36.6%; Pred. No. 8.3e-42; 

Matches 159; Conservative 63; Mismatches 151; Indels 61; Gaps 16; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II : I I I I hlhll = I I hi III 
Db 295 QCKSGFIQD-ALGNCI DINECLS I SAPCPVGQTCINTEGSYTC 336 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM - DESNQCVDVDECATDSHQCNPTQI C 119 

= 11 I II : =1 ^ I I 1 I ! I I I : I I 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHHC 375 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

= 1= I = I I l = = : hlhll II III ||h hh II h 

Db 376 LNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFHCSCSAGFRLS 435 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF--SEF 231 

Mill Ml I I I Mh I I MM : Ml I Mi 

Db 436 VDGRSCEDVNEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 494 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

: I = hi I!- I I I I II I = hlllhll I I- Uhhll hh 

Db 495 ICSYRCINI PGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQGSFRCL 554 

Qy 2 91 DPI RCEE P YLR I SDNRCM CPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATT 347 

I I I I M II I II I III: : I I I ': I : I ■ ■ 

Db 555 S-FECPENYRRSADTRCARLPC-HENQECPRLPLRITYYHLSFPTNIQVPAWFRMGPSS 612 

Qy . 348 RYPGAYY I FQI KSGNEGREF YMRQTGP I SATLVMTRP I KG PRE I QLDLEM - - I TVNTVI N 405 

II I MM I I : I : : I M I I- I I II : 

Db 613 AVPGDSMQLAITAGNEEGFFTTRKVSHHSGWALTKPI PEPRDLLLTVKMDLYRHGTVSS 672 



Qy 4 06 FRGSSVIRLRIYVS 419 

I I I hll 

Db 673 F VAKLFIFVS 682 



RESULT 4 
C36346 

fibulin 1 precursor, splice form C - human 
N;Alternate names: fibulin C 

N;Contains: fibulin 1 splice form A; fibulin 1 splice form C 
C; Species: Homo sapiens (man) 

C;Date: 19-Apr-1991 #sequence_revision 19-Apr-1991 #text_change 02-Aug-2002 
C;Accession: C36346; A36346; A32826 

R;Argraves, W.S.; Tran, H. ; Burgess, W.H.; Dickerson, K. 
J. Cell Biol. Ill, 3155-3164, 1990 



A;Title: Fibulin is an extracellular matrix and plasma glycoprotein with 
repeated domain structure. 

A;Reference number: A36346; MUID: 91100426; PMID:2269669 
A; Access ion: C3634 6 
A; Molecule type: mRNA 
A /Residues: 1-683 <ARG> 

A; Cross-references: GB:X53743; NID:g31418; PIDN: CAA37772 . 1; PID:g31419 
A; Access ion: A3 63 4 6 
A ,-Molecule type: mRNA 
A/Residues : 1-566 <AR2> 

A; Cross-references : GB:X53741; NID:g31414; PIDN : CAA37770 . 1 ; PID:g31415 
R;Argraves, W.S.; Dickerson, K. ; Burgess, W.H.; Ruoslahti, E. 
Cell 58, 623-629, 1989 

A;Title: Fibulin, a novel protein that interacts with the fibronectin receptor 
beta-subunit cytoplasmic domain. 

A;Reference number: A32826; MUID: 89354537 ; PMID:2527614 
A;Accession: A32826 
A;Molecule type: protein 

A;Residues: 30-35 , * SX » , 38 -40 , 1 SH * , 43 -44 <AR3> 
C;Genetics : 

A;Gene: GDB : FBLN1 ; FBLN 

A; Cross -references: GDB:278285; OMIM:135820 

A ; Map position: 22ql3 . 3-22ql3 . 3 

C; Super family: fibulin-1; EGF homology 

C; Keywords: alternative splicing; glycoprotein 

F; 1 -2 9/Domain : signal sequence #status predicted <SIG> 

F;30-683/Product : fibulin 1 splice form C #status predicted <MAT> 

F; 18 0-2 14 /Domain : EGF homology <EGF> 

F;485-523/Domain: EGF homology <EGF1> 

F;98,535,539/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 30.6%; Score 735.5; DB 2; Length 683; 

Best Local Similarity 35.7%; Pred. No. 3.2e-41; 

Matches 155; Conservative 66; Mismatches 152; Indels 61; Gaps 16; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II I I I hi III 

Db 2 93 QCKSGF I QD - ALGNCI DI NECLS I SAPCP I GHTCI NTEGS YTC 334 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATDSHQCNPTQI C 119 

Ml I II : :| :|||lllll : I I 

Db 335 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRC 373 

Qy 12 0 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 



Db 



374 




433 



Qy 



175 



EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF--SEF 



231 



Db 



434 




492 



Qy 



232 



LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 



290 



Db 



493 




552 



291 




348 



Db 



553 



-AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPAWFRMGPSSA 611 



Qy 



349 



YPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITWTVINFRG 4 08 



Db 



612 





I 

3 666 



Qy 



409 



SSVIRLRIYVS 419 



Db 



667 




RESULT 5 
A49457 

fibulin-2 precursor - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 08-Sep-2002 
C;Accession: A49457; S74095 

R;Pan, T.C.; Sasaki, T. ; Zhang, R.Z.; Faessler, R. ; Timpl, R.; Chu, M.L. 
J. Cell Biol. 123, 1269-1277, 1993 

A; Title: Structure and expression of fibulin-2, a novel extracellular matrix 

protein with multiple EGF-like repeats and consensus motifs for calcium binding. 

A; Reference number: A49457; MUID: 94064787 ; PMID: 8245130 

A;Accession: A49457 

A; Status : preliminary 

A; Molecule type: mRNA 

A/Residues : 1-1221 <PAN> 

A; Cross-references : GB:X75285; NID: g437046 ; PIDN : CAA53040 . 1 ; PID:g437047 
R;Sasaki, T. ; Mann, K. ; Murphy, G. ; Chu, M.L.; Timpl, R. 
Eur. J. Biochem. 240, 427-434, 1996 

A;Title: Different susceptibilities of fibulin-1 and fibulin-2 to cleavage by 
matrix metalloproteinases and other tissue proteases. 
A/Reference number: S74094; MUID: 96439073 ; PMID: 8841408 
A /Access ion: S74 095 
A;Molecule type: protein 

A; Residues: 236-238, ! X' , 240-247 ; 260-275 ; 336-344 , 1 L ' , 346-361 ; 405 -426 ; 566- 
568, 1 EM 1 ,569-589; 653-666/784-787, 'X' ,789-794/841-844, 'X', 846-850/883- 
892, 'X' , 894-894/ 930-935, 'X' , 937-939 <SAS> 

C; Super family: fibulin-2; EGF homology/ von Willebrand factor type C repeat 
homology 

C; Keywords: calcium binding,- duplication/ extracellular matrix/ glycoprotein/ 
homotrimer 

F/942-978/Domain: EGF homology <EGF> 

Query Match 3 0.3%/ Score 72 9.5/ DB 2/ Length 1221; 

Best Local Similarity 35.3%; Pred. No. 1.3e-40; 

Matches 145; Conservative 59; Mismatches 148; Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

: I : I I I I - I I. III hi III 
Db 832 RCMDGF-LQDPEGNCVDINECTSLLEPCRSGFSCINTVGSYTC 873 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 



Db 



874 




Qy 120 INTEGGYTCSCTDGYW- -LLEGQCLDIDECRYG- --YCQQLCANVPGSYSCTCNPGFTLN 174 



Db ■ 913 YNLPGSYRCDCKPGFQRDAFGRTCIDVNECWVSPGRLCQHTCENTPGSYRCSCAAGFLLA 972 

Qy 175 EDGRSCQDWECATENPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECS - FSEFLC 233 

I I : hlllll I I I I I llh I I Ihl III hhllh : II 
Db 973 ADGKHCEDVNECETRR- CSQECANI YGS YQCYCRQGYQLAEDGHTCTDI DECAQGAGILC 1031 

Qy 234 QHECVNQPGTYFCSCP - PGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DP 292 

III Ihl hll II - : llhl-ll I h Uhhll hh 

Db 1032 TFRCraVPGSYQCACPEQGYTMMANGRSCKDLDEC^LGTHNCSEAETCHNIQGSFRCL-R 1090 

Qy 293 IRCEEPYLRISDNRCMCPAENPGCRD QPFTI LYRDMDWSGRSVPADI FQMQAT 346 

I hhl =1 Ihl I I : - :| III II- 

Db 1091 FDCPPNYVRVSQTKC ERTTCQD I TECQTS PAR I TH YQLNFQTGLLVPAH I FR I G PA 1146 

Qy 347 TRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDLEM 397 

= 1 I III I h : s : | : ||: ||:|| 

Db 1147 PAFAGDTI SLTI TKGNEEGYFVTRRLNAYTGWSLQRSVLEPRDFALDVEM 1197 



RESULT 6 
S34968 

fibulin, splice form D precursor - mouse 

N;Alternate names: basement -membrane protein BM-90; calcium-binding protein BM- 
90 

C; Species: Mus musculus (house mouse) 

C;Date: 10-Dec-1993 #sequence_revision 10-Nov-1995 #text_change 02-Aug-2002 
C;Accession: S34968; S36441; S13814 

R/Pan, T.C.; Kluge, M. ; Zhang, R.Z.; Mayer, U. ; Timpl, R. ; Chu, M.L. 
Eur. J. Biochem. 215, 733-740, 1993 

A;Title: Sequence of extracellular mouse protein BM-90/f ibulin and its calcium- 
dependent binding to other basement -membrane ligands. 
A;Reference number: S34968; MUID: 93358897 ; PMID:8354280 
A /Access ion: S34 968 
A /Molecule type: mRNA 
A/Residues: 1-705 <PAN> 

R/Pan, T.C.; Kluge, M. ; Zhang, R.Z.; Mayer, U. ; Timpl, R. ; Chu, M.L. 
submitted to the EMBL Data Library, January 1993 

A/Description: Sequence of extracellular mouse protein BM-90/f ibulin and its 

calcium-dependent binding to other basement membrane ligands. 

A/Reference number: S36440 

A; Access ion: S3 6441 

A /Molecule type: mRNA 

A/Residues: 1-39 P 1 , 41-705 <PAW> 

A; Cross -references: EMBL:X70854; NID:g396820/ PIDN : CAA50207 . 1 / PID:g396821 
A; Experimental source: cell -line F9 teratocarcinoma 
.R; Kluge, M. ; Mann, K. ; Dziadek, M. / Timpl, R. 
Eur. J. Biochem. 193, 651-659, 1990 

A/Title: Characterization of a novel calcium-binding 90-kDa glycoprotein (BM-90) 
shared by basement membranes and serum. 

A/Reference number: S13814/ MUID: 91065369/ PMID:2249686 

A/Accession: S13814 

A; Molecule type: protein 

A; Residues: 28 / 31-49 , ' X » , 51-53 / » XX • , 110- 117 / 231 -240 , 'X' ,242-243/339-362, 'S' ,364- 
387/434-439/469-476/ 'Q' ,554-557, ' Q • , 559-563 / 574 -581 <KLU> 
C/Superfamily : fibulin-1; EGF homology 

C/Keywords: alternative splicing/ basement membrane,- calcium binding/ 
extracellular matrix/ glycoprotein/ plasma 



F ; 1 - 2 9 / Doma in : signal sequence #status predicted <SIG> 

F;30-705/Product : fibulin, splice form D #status predicted <MAT> 

F; 98, 537, 541/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 29.2%; Score 702; DB 2; Length 705; 

Best Local Similarity 35.2%; Pred. No. 5.4e-39; 

Matches 160; Conservative 66; Mismatches 154; Indels 74; Gaps 21; 

Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II ' I I hlhll : I I hi Ml 
Db 295 QCKSGF I QD - ALGNCI DI NECLS I SAPCPVGQTC I NTEGS YTC 336 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 

Ml I II : :| :|||||||| : I | 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHHC 375 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

= 1= I = I I h= = hlhll II III llh hh II h 

Db 376 LNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFHCSCSAGFRLS 435 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF--SEF 231 

MllhlMM :|| I I I llh I I Ihl : III I hllh 
Db 436 vr>GRSCEDWEC-LNSPCSQEO^VYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 4 94 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

U = hi I MM II I : hlllhll I h: Mhhll hh 

Db 495 ICSYRCINIPGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQGSFRCL 554 

Qy 291 DPIRCEEPYLRISDN RCM- -CPAENPGC-RDQPFTILYRDMDWSGRSV- - 336 

I I I I =1 Ihl : I II |= : : = : | 

Db 555 S - FECPENYRRSADTFRQEKTDTVRCI KSCRPNDEACVRDPVHT VSHTVI SLPTFREFTR 613 

Qy 337 PAD I FQMQATT-RYPG- -AYYI FQI KSGNEGREFYM RQTGP I SATLVMTRP I KGPRE 390 

I = 1 :H I II I II I II I : * I : Ml II 

Db 614 PEEI IFLRAVTPLYPANQADI IFDITEGNLRDSFDI IKRYEDGMTVGWRQVRPIVGPFY 673 

Qy 391 I QLDLEM - I TVNTVI NFRGSSVI RLR I YVSQYPF 423 

I III : h: I :|« = hi hi I 
Db 674 AVLKLEMNYVLGGWSHR - -NWNVHI FVSEYWF 705 



RESULT 7 
A55184 

fibulin-2 precursor - human 

N;Alternate names: protein DKFZp586A1519 . 1 

C; Species: Homo sapiens (man) 

C;Date: 27-Jan-1995 #sequence_revision 27-Jan»1995 #text_change 08-Sep-2002 
C;Accession: A55184; T08744 

R;Zhang, R.Z.; Pan, T.C.; Zhang, Z.Y.; Mattei, M.G. ; Timpl , R. ; Chu, M.L. 
Genomics 22, 425-430, 1994 

A;Title: Fibulin-2 { FBLN2 ) : human cDNA sequence, mRNA expression, and mapping of 

the gene on human and mouse chromosomes. 

A; Reference number: A55184; MUID: 95104855 ; PMID:7806230 

A; Access ion: A55184 

A; Status : preliminary 

A; Molecule type: mRNA 

A;Residues: 1-1184 <ZHA> 



A; Cross-references: GB:X82494; NID:g575232; PIDN : CAA57876 . 1 ; PID:g575233 
R;Wambutt, R.; Heubner, D.; Mewes, H.W.; Gassenhuber, J. ; Wiemann, S. 
submitted to the Protein Sequence Database, March 1999 
A; Reference number: Z16471 
A;ACCession: T08744 
A; Molecule type: mRNA 

A; Residues: 656-719, 1 QDECLMGAHDCSRRQFCWTLGSFYCVNHTVLCADGYILNAHRKCVD 1 ,720- 

853, "I" ,855-1184 <WAM> 

A; Cross-references : EMBL: AL050095 

A; Experimental source: adult uterus; clone DKFZp586A1519 

C;Genetics : 

A; Gene: GDB : FBLN2 

A; Cross -references: GDB: 293037; OMIM: 135821 
A ; Map position: 3p25-3p24 
A;Note: DKFZp586A151 9 . 1 

C; Super family: fibulin-2; EGF homology; von Willebrand factor type C repeat 
homology 

C; Keywords: alternative splicing; extracellular matrix 
F; 1-27 /Domain : signal sequence #status predicted <SIG> 
F;28-1184/Product : fibulin-2 protein #status predicted <MAT> 
F; 905-941/ Doma in : EGF homology <EGF> 

Query Match 29.1%; Score 701.5; DB 2; Length 1184; 

Best Local Similarity 34.5%; Pred. No. 9.1e-39; 

Matches 142; Conservative 58; Mismatches 152; Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60. 

: I I I ■ -I :: Ml hi Ml 
Db 795 RCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC 836 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQ-MDESNQCVDVDECATDSHQCNPTQI C 119 



Db 



837 




Qy 



120 I NTEGG YTCS CTDG YW - - LLEGQCLD I DECR YG YCQQLCANVPGSYSCTCNPGFTLN 174 

I I I I I h 1 = 1=11 II I I III hi II I 

876 HNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENTLGSYRCSCASGFLLA 935 



Db 



Qy 



175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECS-FSEFLC 233 

Ih hi : I I I I Mh M Ih Ml I : I : I I I : : II 
936 ADGKRCEDVNECEAQR - CSQECANI YGS YQCYCRQGYQLAEDGHTCTDI DECAQGAGI LC 994 



Db 



Qy 



234 QHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDP 2 92 



Db 



I - l M*l I • II II * • 111*1-11 I | • : I I : I : I I i : I : 

995 TFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHNIQGSFRCL-R 1053 




Qy 



293 I RCEEPYLRI SDNRCMCPAENPGCRD QPFTI LYRDMDWSGRSVPADI FQMQAT 346 

I h-l =1 III I I : := =| III Ih: 

1054 FECPPNYVQVSKTKC ERTTCHDFLECQNSPARITHYQLNFQTGLLVPAHI FRIGPA 1109 



Db 



Qy 



347 TR YPGA Y Y I FQ I KSGNEGREF YMRQTG P I SATLVMTR P I KG PRE I QLDLEM 397 



Db 



• I I III I I * • I II- 11*11 

1110 PAFTGDT I ALN I I KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDFALDVEM 116 




RESULT 8 
B36346 



fibulin 1 precursor, splice form B - human 
C; Species: Homo sapiens (man) 

C;Date: 19-Apr-1991 #sequence_revision 19-Apr-1991 #text_change 02-Aug-2002 
C; Access ion: B3634 6 

R;Argraves, W.S.; Tran, H. ; Burgess, W.H.; Dickerson, K. 
J. Cell Biol. Ill, 3155-3164, 1990 

A; Title: Fibulin is an extracellular matrix and plasma glycoprotein with 
repeated domain structure. 

A/Reference number: A36346; MUID: 91100426; PMID: 2269669 
A; Access ion: B3634 6 
A /Molecule type: mRNA 
A; Residues: 1-601 <ARG> 

A; Cross -references : GB:X53742; NID:g31416; PIDN: CAA37771 . 1 ; PID:g31417 
C;Genetics: 

A;Gene: GDB : FBLN1 ; FBLN 

A; Cross -references : GDB: 278285; OMIM: 13582 0 
A ; Map position: 22ql3 .3-22ql3 . 3 
C;Superfamily : fibulin- 1; EGF homology 
C; Keywords: alternative splicing 
F; 180-214/ Doma in : EGF homology <EGF1> 
F;485-523/Domain: EGF homology <EGF> 

Query Match 25.7%; Score 617.5; DB 2; Length 601; 

Best Local Similarity 38.3%; Pred. No. 1.7e-33; 

Matches 119; Conservative 46; Mismatches 95; Indels 51; Gaps 12; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60, 

II hi I I hlhll :| I hi Ml 
Db 293 QCKSGF I QD - ALGNCI D I NECLS I SAPCP I GHTC I NTEGS YTC 334 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 

I I ' ! I! : | | 

Db 335 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRC 373 

Qy 12 0 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

H: I : I I Ih : hl-ll || II ||| |:|: I h 

Db 374 VNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYLCSCSVGFRLS 433 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE - DGVHCSDMDECSF - - SEF 231 

i I I M : 1 : I I I : : HI I I I Ih I I Ih : I | hllh 

Db 434 VDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 492 

Qy 232 LCQHECVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI 290 

■■\ ■■ hi Ih: I I I I II I : MM | |:: :||:|:|| hh 

Db 493 I CS YRCINI PGSFQCSCPSSGYRLAPNGRN CQDI DECVTGI HNCS I NETCFNI QGAFRCL 552 

Qy 291 DPIRCEEPYLR 3 01 

I I I I 

Db 553 -AFECPENYRR 562 



RESULT 9 
T42760 

fibulin, splice form D precursor - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 23-Sep-2002 
C; Access ion: T42760 



R;Barth, J.L.; Argraves , K.M.; Roark, E.F.; Little, CD. ; Argraves, W.S. 
submitted to the EMBL Data Library, February 1998 

A; Description: Isolation of chicken and nematode fibulin-1 homologs and 
characterization of the nematode fibulin-1 gene. 
A; Reference number: Z22267 
A; Access ion: T42760 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-689 <BAR> 

A; Cross-references: EMBL:AF051401; PIDN : AAC28321 . 1 
C;Genetics : 
A; Note : FBLN1 

Query Match 25.1%; Score 603; DB 2; Length 689; 

Best Local Similarity 27.4%; Pred. No. 1.7e-32; 

Matches 156; Conservative 52; Mismatches 155; Indels 206; Gaps 19; 

Qy 2 CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 

I Mill hlMM |: : | hi | : II 
Db 180 CRSGFDLAPDGMACVDIDECATLMDDCLESQRCLNTPGSFKCI 222 

Qy 62 PYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD-ESNQCVDVDECATDSHQCNPTQICI 12 0 

III II II h M Mill II I I I 
Db 223 RTLSCGTGYAMDSETERCRDVDECNLGSHDCGPLYQCR 260 

Qy 121 NTEGGYTC SCTDGYWLLEGQCLDI DECRYGY 151 

m i I I : I Mh I I mil i 

Db 261 NTQGSYRCDAKKCGDGELQNPMTGECTS I TCPNGYYPKNGMCNDI DECVTGHNCGAGEEC 320 

Qy 152 CQQ LCA NVPGSYSCTCNPGF 171 

Ml III hlhl I I lh 

Db 321 VNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECINLPGTYKCKCGPGY 380 

Qy 172 TLNE DGRSCQDVNECA 187 

I : III 1 = 11111 

Db 381 EFNDAKKRCEDVDECIKFAGHVCDLSAECINTIGSFECKCKPGFQLASDGRRCEDVNECT 44 0 

Qy 188 TE -NPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECSF SEFLCQHECVNQPG 242 

I I I III lh III II I h II || hi | 

Db 441 TGIAACEQKCVNI PGSYQCICDRGFALGPDGTKCEDIDECSIWAGSGNDLCMGGCINTKG 500 

Qy 243 TYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRI 302 

: I I Mill : I Ml ' : I I I III II 

Db 501 S YLCQCPPGYKI QPDGRTCVDVDECA - MGECAGSDKVCVNTLGSFKC - HS I DCPTNY I HD 558 

Qy 303 SDNR C- -MCPAENPGC-RDQPFTILYRDMDWSGRSV- -PADI 340 

I h hi h h Oh II::: I - 

Db 559 SLNKNQIADGYSCIKVCSTEDTECLGNHTREVLYQFRAVPSLKTIISPIEVSRIVTHMGV 618 

Qy 341 -FQMQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMIT 399 

I : | | : :: | | =11 II Ml 
Db 619 P FS VD YNLD YVGQRHFR I VQERN I G IVQLVKPISGP TVETIK 660 

Qy 400 VNTVINFRGSSVIR LRIYVSQYPF 423 

II I : I hi 

Db 661 VNI HTKSRTGVI LAFNEAI I EI SVSKYPF 689 



RESULT 10 
T42990 

fibulin 1, splice form C precursor - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 08-Sep-2002 
C; Access ion: T42 990 

R;Barth, J.L.; Argraves , K.M.; Roark, E.F.; Little, CD. ; Argraves, W.S. 
submitted to the EMBL Data Library, February 1998 

A; Description: Isolation of chicken and nematode fibulin-1 homologs and 
characterization of the nematode fibulin-1 gene. 
A/Reference number: Z22267 
A; Access ion: T42 990 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-712 <BAR> 

A; Cross-references : EMBL: AF051402 ; PIDN : AAC28322 . 1 
C; Genetics : 
A; Gene: FBLN1 

C;Superfamily: fibulin-1; EGF homology 

C; Keywords: alternative splicing; basement membrane; extracellular matrix 

Query Match 24.8%; Score 596; DB 2; Length 712; 

Best Local Similarity 27.9%; Pred. No. 5.1e-32; 

Matches 152; Conservative 46; Mismatches 145; Indels 202; Gaps 18; 
Qy 2 CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 

I Ml li I hi M II 

Db 180 CRSGFDLAPDGMACVDIDECATLMDDCLESQRCLNTPGSFKCI 222 

Qy 62 PYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMD-ESNQCVDVDECATDSHQCNPTQICI 120 

III II II h =1 Mill " II I I I 

Db 223 RTLSCGTGYAMDSETERCRDVDECNLGSHDCGPLYQCR 260 

Qy 121 NTEGGYTC SCTDGYWLLEGQCLDI DECRYGY 151 

' I I :| | | Mill ' 

Db 261 NTQGSYRCDAKKCGDGELQNPMTGECTSITCPNGYYPKNGMCNDIDECVTGHNCGAGEEC 320 

Qy 152 CQQ LCA NVPGSYSCTCNPGF 171 

IN Ml hlhl I I Ih 

Db 321 VNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECINLPGTYKCKCGPGY 380 

Qy 172 TLNE DGRSCQDVNECA 187 

M III ; ,|| 

Db 381 EFNDAKKRCEDVDECIKFAGHVCDLSAECINTIGSFECKCKPGFQLASDGRRCEDVNECT 44 0 

Qy 188 TE-NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSF SEFLCQHECVNQPG 242 

I Mill Ih III h I II I MINI II hi I' 

Db 441 TGIAACEQKCVNI PGSYQCICDRGFALGPDGTKCEDIDECSIWAGSGNDLCMGGCINTKG 500 

Qy 243 TYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRI 302 

= 1 I Mill : I hi h:|| II |: 

Db 501 SYLCQCPPGYKIQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFKC-HSIDCPTNYIHD 558 

Qy 303 SDNR CMCPAENPGCRDQPFTI LYRDMDWSGRSVP 337 

I 1= III I I = | : : : |:|| 
Db 559 SLNKNRCNRQPSACGLPEE CSKVPLFLTYQFISL- -ARAVPISSHRPAITLFKVSAP 613 



Qy 338 --ADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGP- - 1 SATLVMTRPI KGPRE 390 

II h = | III: I - I I II = = : Ih 

Db 614 NHADTEVNFELQLKTTIVGAPNVLPAIRAN FLLQKGEKRNSAWTLRDSLDGPQT 668 

Qy 391 IQLDL 3 95 

-I I 

Db 669 VKLQL 673 



RESULT 11 
T43210 

fibulin-lD precursor - Caenorhabditis elegans (fragment) 
C;Species: Caenorhabditis elegans 

C;Date: ll-Jan-2000 #sequence_revision ll-Jan-2000 #text_change 08-Sep-2002 
C; Access ion: T43210 

R;Barth, J.L.; Argraves, K.M. ; Roark, E.F.; Little, CD. ; Argraves, W.S. 
submitted to the EMBL Data Library, June 1998 

A; Description: Identification of chicken and C. elegans fibulin-1 homologs and 
characterization of the C. elegans fibulin-1 gene. 
A /Reference number: Z22337 
A; Access ion: T43210 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;ResidueS: 1-589 <BAR> 

A ; Cros S - r e f er enc e S : EMBL : AF 0 7 0 4 7 7 ; PIDN : AAC2 4 035.1 
C;Genetics : 

A; Note: intron positions not resolved (incomplete sequence) 
C;Superfamily : fibulin-1; EGF homology 

Query Match 24.6%; Score 592.5; DB 2; Length 589; 

Best Local Similarity 27.4%; Pred. No. 7.4e-32; 

Matches 156; Conservative 52; Mismatches 155; Indels 207; Gaps 20; 



Qy 2 CTNGFDLDRQSGQCLD- I DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I Ml hi I I I I h : I hi I : II 

Db 79 CRSGFDLAPDGMACVDHIDECATLMDDCLESQRCLNTPGSFKCI 122 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD- ESNQCVD VDECATDSHQCNPTQI C 119 

III II II h I lllll II I I I 
Db 123 RTLSCGTGYAMDSETERCRDVDECNLGSHDCGPLYQC 159 

Qy 12 0 INTEGGYTC S CTDG YWLLEGQCLD I DECRYGY 151 

I I I I :| :|h I I ||||| 

Db 160 RNTQGSYRCDAKKCGDGELQNPMTGECTSITCPNGYYPKNGMCNDIDECVTGHNCGAGEE 219 

Qy 152 CQQ LCA NVPGSYSCTCNPG 170 

Ml Ml hlhl I I I I 

Db 220 CVNTPGSFRCQQKGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECINLPGTYKCKCGPG 279 

Qy 171 FTLNE DGRSCQDVNEC 186 

s 1 III hlllll 

Db 280 YEFNDAKKRCEDVDECIKFAGHVCDLSAECINTIGSFECKCKPGFQLASDGRRCEDVNEC 339 

Qy 187 ATE-NPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSF SEFLCQHECVNQP 241 

I Mill Ih I II h I II I hill I I hi 

Db 340 TTGI AACEQKCVNI PGS YQCI CDRGFALGPDGTKCEDI DECS I WAGSGNDLCMGGCI NTK 399 



Qy 242 GTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLR 301 

hi I I I II I : I hi h:|| : I I I III II I 

Db 4 00 GSYLCQCPPGYKIQPDGRTC^VDEC^-MGECAGSDKVCVNTLGSFKC-HSIDCPTNYIH 457 

Qy 3 02 ISDNR C- -MCPAENPGC-RDQPFTI LYRDMDWSGRSV- - PAD I 340 

| |= | :| |: | : :||: | | ::: | :: 

Db 458 DSLNKNQIADGYSCI KVCSTEDTECLiGNHTREVLYQFRAVPSLKTI I SPI EVSRI VTHMG 517 

Qy 341 --FQMQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMI 398 

I = | | : :: | | =11 II : I I 

Db 518 VPFS VDYNLDYVGQRHFRI VQERN I G IVQLVKPISGP TVETI 559 

Qy 3 99 TVNTVI NFRGSSVI R LRIYVSQYPF 423 

II I : I Ihlll 

Db 560 KVNIHTKSRTGVILAFNEAI IEISVSKYPF 589 



RESULT 12 
T22793 

hypothetical protein F56H11.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 02-Aug-2002 
C;Accession: T22793; T24489 
R;Wilkinson, J. 

submitted to the EMBL Data Library, January 1996 
A; Reference number: Z19616 
A; Accession : T22793 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A/Residues : 1-798 <WIL> 

A; Cross -references : EMBL:Z68749; PIDN: CAA92962 . 1 ; GSPDB : GN00022 ; CESP : F56H11 . 1 
A; Experimental source: clone F56H11 
R; Lloyd, C. 

submitted to the EMBL Data Library, December 1995 
A; Reference number: Z198 97 
A; Access ion: T2448 9 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-798 <WI2> 

A; Cross-references : EMBL:Z68219; PIDN : CAA92 4 83 . 1 ; GSPDB : GN00022 ; CESP : F56H11 . 1 

A; Experimental source: clone T05A1 

C; Genetics : 

A;Gene: CESP:F56H11. 1 

A; Map position: 4 

A;Introns: 14/1; 92/3; 144/1; 195/1; 281/1; 325/2; 371/2; 390/1; 437/1; 471/3; 
498/3; 607/1; 649/2; 718/1 
C;Superfamily : fibulin-1; EGF homology 

Query Match 23.8%; Score 574; DB 2; Length 798; 

Best Local Similarity 26.8%; Pred. No. 1.6e-30; 

Matches 159; Conservative 49; Mismatches 172; Indels 214; Gaps 22; 

Qy 2 CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLC IP RTNPVY 51 

I =1111 hi :|| I I Ml |||: | :| | | : 

Db 180 CRSGFDLAPDGMACVDRNECLTRQSPCTQSEDCVNTIGGYICQRRISRLVPHRHRANRIG 239 



Qy 52 RGP YSNPYSTPYSGPYPAAAPP 73 

I M H I h 

Db 240 NAPRRMRDDPYSR- -AGEYREASQANTEFGCPMGWLFQHGHCVDI DECATLMDDCLESQR 297 

Qy 74 -LSAPNYPTI SRPLI CRFGYQMD ESNQCV DVDECATDSHQC 113 

hi I I I II II III Mill II I 

Db 298 CLNTPGSFKCI RTLSCGTGYAMDSETERNNCFLI I LNNTFNCKYFFVEDVDECNLGSHDC 357 

Qy 114 NPTQ I C I NTEGG YTCS 129 

I I Ihl I I 

Db 358 GPLYQCRNTQGSYRCDAKKCGDGELQNPMTGEYIDECVTGHNCGAGEECVNTPGSFRCQQ 417 

Qy 130 CTDGYWL- - LEGQCLDI DECRYGYCQQL- CANVPGS YSCTCNPGFTLNE 175 

I II I | | : Ml | | I h 

Db 418 KGNLCAHGYEVNGATGFCEDVNECQQGVCGSMECINLPGTYKCKCGPGYEFNDAKKRCED 4 77 

Qy 176 DGRSCQDVNECATE-NPCVQTCV 197 

III hlllll I I I II 

Db 478 VDECI KFAGHVCDLSAECI NTI GSFECKCKPGFQLASDGRRCEDVNECTTGI AACEQKCV 537 

Qy 198 NTYGSFI CRCDPGYELEEDGVHCSDMDECSF SEFLCQHECVNQPGTYFCSCPPGYI 253 

I Ih I II h I II I Mill II hi hi I HIM 

Db 538 NI PGS YQCI CDRGFALGPDGTKCEDI DECS I WAGSGNDLCMGGCI NTKGS YLCQCPPGYK 597 

Qy 254 LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNR 306 

: I hi I = II I III II h I h 

Db 598 IQPDGRTCVDVDECA-MGECAGSDKVCVNTLGSFKC-HSIDCPTNYIHDSLNKNRCNRQP 655 

Qy 307 - - CMCPAENPGCRDQPFTI LYRDMDWSGRSVP ADI FQM 343 

III I I : I : : : |:|| || |:: 

Db 656 SACGLPEE CSKVPLFLTYQFISL- -ARAVPISSHRPAITLFKVSAPNHADTEVNFEL 710 

Qy 344 QATTRYPGAYYI FQI KSGNEGREFYMRQTGP - - I SATLVMTRPI KGPREI QLDL 395 

I I lh I :: I I || : : : ||: ::| | 

Db 711 QLKTTIVGAPNVLPAIRAN FLLQKGEKRNSAWTLRDSLDGPQTVKLQL 759 



RESULT 13 
A35626 

transforming growth factor beta-l-binding protein - human 
C; Species: Homo sapiens (man) 

C;Date: 21-Sep-1990 #sequence_revision 21-Sep-1990 #text_change ll-Jan-2000 
C; Access ion: A3 5626 

R;Kanzaki, T. ; Olofsson, A.; Moren, A.; Wernstedt, C. ; Hellman, U. ; Miyazono, 
K. ; Claesson-Welsh, L. ; Heldin, c.H. 
Cell 61, 1051-1061, 1990 

A;Title: TGF-betal binding protein: a component of the large latent complex of 

TGF-betal with multiple repeat sequences. 

A; Reference number: A35626; MUID : 90275601 ; PMID:2350783 

A;Accession: A35626 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A;Residues: 1-1394 <KAN> 

A; Cross-references : GB : M34057 ; NID:g339547; PIDN : AAA61160 . 1 ; PID:g339548 
C;Superfamily : unassigned EGF-related proteins; EGF homology 
C;Keywords: alternative splicing 
F;750-791/Domain: EGF homology <EGF> 



Query Match 22.4%; Score 539; DB 2; Length 1394; 

Best Local Similarity 29.6%; Pred. No. 5.3e-28; 

Matches 126; Conservative 51; Mismatches 126; Indels 122; Gaps 17; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



2 CTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYST 61 

I h I :|:|MII : I II I :||| 

573 CYEGYRFSEQQRKCVDIDECTQVQHLC-SQGRCENTEGSFLCI 614 

62 PYSGPYPAAAPPLSAPNYPTISRPLI CRFGYQMDESNQCVDVD 104 

: 1= : = || = | ||:| : :| H 

615 - CPAGFMASEEGTNCI DVDECLRPDVCGEGHCVNTVGAFRCEYCDSGYRMTQRGRCEDI D 673 

105 ECATDSHQCN PTQ I C I NTEGGYTC - S CTDG YWLLEGQCLD I DEC - RYGYCQQ - LCANVPG 161 

II I I I : hh I I I Ihh llllhlll I hh I 

674 ECLNPS-TC-PDEQCVNSPGSYQCVPCTEGFRGWNGQCLDVDECLEPNVCANGDCSNLEG 731 

162 SYSCTCNPGFTLNEDGRSCQDVNECATENPCVQ 194 

II hh hi I : hl-ll I II 
732 SYMCSCHKGYTRTPDHKHCRDIDECQQGNLCVNGQCKNTEGSFRCTCGQGYQLSAAKDQC 791 

195 TCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQH-ECV 238 

I II III I II II I II |::|| : : I I ! 

792 EDI DECQHRHLCAHGQCRNTEGSFQCVCDQGYRASGLGDHCEDINECLEDKSVCQRGDCI 851 

239 NQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

I hi Ml h lllh:||||||||| I I I I : I I I : 

852 NTAGS YDCTCPDGF - QLDDNKTCQDI NECEHPG - LCGPQGECLNTEGSFHCVCQQGFS IS 909 

291 -DPIRCEEPYLRIS DN RCMC PAENPGCRDQPFTI LYRDM 328 

I I - II Ihl I : II I 

910 ADGRTCEDIDECVNNTVCDSHGFCDNTAGSFRCLCYQGFQAPQDGQGCVD VNEC 963 

32 9 DWSG 333 

:::|| 
964 ELLSG 968 



RESULT 14 
A47221 

fibrillin 1 precursor - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 02-Jun-1995 #sequence_revision 25-Apr-1997 #text_change 02-Aug-2002 
C;Accession: A47221; 154355; S17064; 159574; S17062; S62111; A34198 
R;Corson, G.M.; Chalberg, S.C.; Dietz, H.C.; Charbonneau, N.L.; Sakai, L.Y. 
Genomics 17, 476-484, 1993 

A;Title: Fibrillin binds calcium and is coded by cDNAs that reveal a multidomain 
structure and alternatively spliced exons at the 5* end. 
A;Reference number: A47221; MUID: 94010947 ; PMID: 7691719 
A; Access ion: A47221 
A; Molecule type: mRNA 

A;Residues: 1-337, 'T' ,339-1029 <COR> 
A; Cross-references : GB:X63556 

R;Pereira, L.V.; D'Alessio, M. ; Ramirez, F. ; Lynch, J. ; Sykes, B. ; Pangilinan, 
T. ; Bonadio, J. 

Hum. Mol. Genet. 2, 961-968, 1993 



A;Title: Genomic organization of the sequence coding for fibrillin, the 

defective gene product in Margan syndrome. 

A;Reference number: 154355; MUID: 93372860 ; PMID: 8364578 

A;Accession: 154355 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 132-3002 <PER> 

A; Cross -references: GB:L13923; NID:g306745; PIDN : AAB02036 . 1 ; PID:g306746 
R;Maslen, C.L.; Corson, G.M. ; Maddox, B.K.; Glanville, R.W.; Sakai, L.Y. 
Nature 352, 334-337, 1991 

A; Title: Partial sequence of a candidate gene for the Marfan syndrome. 

A; Reference number: S17064; MUID: 91304568 ; PMID:1852207 

A; Access ion: S 17 064 

A; Molecule type: mRNA 

A;Residues: 1030-3002 <MAS> 

A; Cross-references : EMBL:X63556 

R;Dietz, H.C.; Valle, D. ; Francomano, C.A. ; Kendzior, R.J. 
Science 259, 680-683, 1993 

A; Title: The skipping of constitutive exons in vivo induced by nonsense 
mutations . 

A;Reference number: 159574; MUID: 93157831; PMID:8430317 
A; Access ion : 159574 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 2217-2288 , 1 1 * , 2290-2325 <RES> 

A; Cross -references: GB:S54426; NID:g264860; PIDN : AAB25244 . 1 ; PID:g264861 
R;Lee, B. ; Godfrey, M. ; Vitale, E. ; Hori, H. ; Mattei, M.G.; Sarfarazi, M. ; 
Tsipouras, P.; Ramirez, F . ; Hollister, D.W. 
Nature 352, 330-334, 1991 

A; Title: Linkage of Marfan syndrome and a phenotypically related disorder to two 
different fibrillin genes. 

A;Reference number: S17062; MUID: 91304567; PMID: 1852206 
A; Access ion: S17062 
A; Molecule type: mRNA 

A;ResidueS: * VLVTWFIFLSYNKML' , 944-1444 <LEE1> 

A; Cross-references : EMBL:X62008; NID:g31398; PIDN : CAB56534 . 1 ; PID:g5924015 
A; Access ion: S62111 
A;Molecule type: protein 

A; Residues: 1166-1176, 'X 1 , 1178-118 0, ' D • , 1182 -1185 <LEE2> 
R;Maddox, B.K.; Sakai, L.Y. ; Keene, D.R.; Glanville, R.W. 
J. Biol. Chem. 264, 21381-21385, 1989 

A; Title: Connective tissue microfibrils. Isolation and characterization of three 
large pepsin-resistant domains of fibrillin. 
A,-Reference number: A34198; MUID : 90078246 ; PMID: 2512293 
A; Access ion: A34198 
A;Molecule type: protein 

A; Residues: 565-575; 1890-18 92 , » I 1 , 1894-1900 <MAD> 

C; Comment: Fibrillin is a major component of elastin-associated microfibrils. 

C; Genetics : 

A; Gene: GDB : FBN1 

A; Cross -references : GDB: 127115; OMIM: 134797; OMIM: 154700 
A ; Map position: 15q21 . 1-I5q21 . 1 
A;Introns: 2236/1; 2258/1; 2297/1 
C;Superfamily : fibrillin 1; EGF homology 

C; Keywords: alternative splicing; calcium binding; extracellular matrix ; 
glycoprotein; Marfan syndrome 



F; 1-3 002 /Product : fibrillin (5 '-region exon A splice form) (fragment) #status 
predicted <MATA> 

F; 132 -3 002 /Product : fibrillin (5' -region exon C splice form) #status predicted 
<MATC> 

F; 1332 -13 67 /Domain: EGF homology <EGF> 

F; 1457-14 92 /Domain: EGF homology <EGF2> 

F;2262-2295/Domain: EGF homology <EGF1> 

Query Match 21.7%; Score 523.5; DB 2; Length 3002; 

Best Local Similarity 30.3%; Pred. No. 1. le-26; 

Matches 122; Conservative 47; Mismatches 120; Indels 113; Gaps 18; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

H : I I II = I I 1 I I I I h I I Ml | : | : : | 
Db 1185 RCDSGFALDSEERNCTDIDECRISPDLC-GRGQCVNTPGDFEC- -KCDEGYESGF 1236 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISR PLICRF GYQMDES-NQ 99 

= I I Ihll 1=1= : = 

Db 1237 MMMKNCMDI DECQRDPLLCRGGVCHNTEGS YRCECPPGHQLS PNI SA 1283 

Qy 100 CVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWLLEGQ- -CLDIDECRY- -GYCQQL 155 

: ! :| I I 1 = 111 hi || : II | 

Db 1284 CI DI NECELSAHLC - PNGRCVNL I GKYQCACNPG YHSTPDRLFCVD I DECS I MNGGCETF 1342 

Qy 156 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENP 191 

I I I I I hi III I I Ml l = = ll 
Db 1343 CTNSEGSYECSCQPGFALMPDQRSCTDIDEC-EDNPNICDGGQCTNIPGEYRCLCYDGFM 1401 

Qy 192 -- CVQ TCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSE 230 

II II II Mill II II == hhMI 

Db 1402 AS EDMKTCVDVNECDLN PN I CLSGTCENTKGS F I CHCDMG YSGKKGKTGCTD I NE CE I GA 1461 

Qy 231 FLC-QHE-CVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFK 288 

I : I M h= Ml || = | I I |::|| = | h III:: 
Db 1462 HNCGKHAVCTNTAGSFKCSCSPGWI - -GDGIKCTDLDECSNGTHMCSQHADCKNTMGSYR 1519 

Qy 289 CIDPIRCEEPYL RISDNRCMCPAENPGCRDQP 320 

• h hi I hi :| I I : I 

Db 1520 CL CKEGYTGDGFTCTDLDECSENLNLC - - GNGQCLNAP 1555 



RESULT 15 
A55567 

fibrillin I - bovine 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 20-Feb-1995 #sequence_revision 20-Feb-1995 #text_change 02-Aug-2002 
C; Access ion: A55567 

R;Tilstra, D.J.; Li, L.; Potter, K.A. ; Womack, J. ; Byers, P.H. 
Genomics 23, 480-485, 1994 

A; Title: Sequence of the coding region of the bovine fibrillin cDNA and 
localization to bovine chromosome 10. 

A; Reference number: A55567; MUID: 95137597; PMID:7835900 
A; Access ion: A55567 
A; Status : preliminary 
A; Molecule type: mRNA 
A;Residues: 1-2871 <TIL> 

A; Cross-references: GB:L28748; NID:g508427; PIDN: AAA74122 . 1 ; PID:g508428 



C;Superfamily: fibrillin 1; EGF homology 
F; 12 01 -12 36 /Domain: EGF homology <EGF> 



Query Match 21.7%; Score 522.5; DB 2; Length 2871; 

Best Local Similarity 29.9%; Pred. No. 1.2e-26; 

Matches 120; Conservative 45; Mismatches 124; Indels 113; Gaps 16; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= 1 = 11 II = II I I I II | : | | ||| | : | : = | : 
Db 1054 RCDSGFALDS EERNCTD I DECRI S PDLC -GRGQCVNTPGDFEC - - KCDEGYESGF 1105 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISR PLI CRFGYQMDES NQ 99 

= I I 11 = 11 I = = 

Db 1106 MMMKNCMDI DECQRDPLLCRGGVCLNTEGS YRCECPPGHQLAPNI SA 1152 

Qy 100 CVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWLLEGQ- - CLDI DECRY- -GYCQQL 155 

l = h = ll =1 II 1 = 1 I I hi II : hlMM I h 

Db 1153 CIDINECELSAHLC-PHGRCVNLIGKYQCACNPGYHSTPDRLFCVDIDECSIMNGGCETF 1211 

Qy 156 CANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENP 191 

I I III hi III I I III h:|| 
Db 1212 CTNS EGS YECSCQPGFALMPDQRSCTD I DEC - EDNPN I CDGGQCTN I PGE YRCLCYDGFM 1270 

Qy 192 CVQ TCVNTYGS F I CRCDPGYELEEDGVHCSDMDECSFS E 230 

II II II Mill II II h 'I 

Db 1271 ASEDMKTCVDVNECDLNPN I CLSGTCENTKGSF I CHCDMGYSGKKGKTGCTDI NECE I GA 1330 

Qy 231 FLCQHE- -CVNQPGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFK 288 

I I I |:'= III I hi I | |::|| : | |: | | | :: 

Db 1331 HNCDRHAVCTNTAGSFKCSCSPGWI - -GDGI KCTDLDECSNGTHMCSQHADCKNTMGSYR 1388 

Qy 289 CIDPIRCEEPYL RISDNRCMCPAENPGCRDQP 320 

1= hi I ' hi :| | | : | 

Db 1389 CL CKEG YTGDGFTCTDLDECSENLNLC - - GNGQCLNAP 1424 



Search completed: January 9, 2004, 12:34:54 
Job time : 17.0264 sees 



GenCore version 5.1.6 
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OM protein - protein search, using sw model 
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Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 9, 2004, 12:21:36 ; Search time 9.71297 Seconds 

(without alignments) 
2048.013 Million cell updates/sec 

US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLDIDE INFRGSSVIRLRI YVSQYPF 423 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



127863 



Database 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
FBL5_HUMAN 

ID FBL5_HUMAN STANDARD; PRT; 448 AA. 

AC Q9UBX5 ; 075966; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-5 precursor (FIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) (Urine p50 protein) (UP50) . 

GN FBLN5 OR DANCE. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Melanoma; 

RA Kostka G. ; 

RL Submitted (MAR-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99357779; PubMed=10428823 ; 

RA Nakamura T. , Ruiz-Lozano P., Lindner V., Yabe D. , Taniwaki M. f 

RA Furukawa Y. , Kobuke K. , Tashiro K. , Lu Z., Andon N.L. , Schaub R. , 



RA Matsumori A., Sasayama S., Chien K.R., Honjo T. ; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic, and balloon -injured arteries."; 

RL J. Biol. Chem. 274:22476-22483(1999). 

RN [3] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Urine; 

RA Zemel R. , Sholto 0., Shaul Y. ; 

RL Submitted (SEP-1998) to the EMBL/ GenBank / DDB J databases. 

CC -!- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT AND REMODELING. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN HEART, OVARY, AND 
CC COLON BUT ALSO IN KIDNEY, PANCREAS, TESTIS, LUNG, AND PLACENTA. 

CC NOT DETECTABLE IN BRAIN, LIVER, THYMUS, PROSTATE, OR PERIPHERAL 

CC BLOOD LEUKOCYTES. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC --- 

DR EMBL; AJ133490; CAB38568.1; -. 

DR EMBL; AF112152; AAD41768.1; -. 

DR EMBL; AF093118; AAC62107.1; -. 

DR HSSP; P00736; 1APQ. 

DR Genew; HGNC:3602; FBLN5 . 

DR MIM; 604580; -. 

DR GO; GO:0005578; C : extracellular matrix; TAS. 

DR GO; GO:0005625; C:soluble fraction; TAS. 

DR GO; GO: 0005178; F:integrin binding activity; TAS . 

DR GO; GO:0007160; P : cell -matrix adhesion; TAS. 

DR InterPro; TPR000152; Asx_hydroxyl . 

DR InterPro ; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 4. 

DR SMART; SM00179; EGF_CA ; 4. 

DR PROSITE; PS00010; ASX_H YDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF__2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Cell adhesion; Calcium-binding; Repeat; Signal; EGF-like domain; 

KW Glycoprotein. 

FT SIGNAL 1 23 POTENTIAL. 

FT CHAIN 24 448 FIBULIN-5. 

FT DOMAIN 24 69 EGF-LIKE 1, DIVERGENT. 

FT DOMAIN 127 167 EGF-LIKE 2, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 168 206 EGF-LIKE 3, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 207 246 EGF-LIKE 4, CALCIUM -BINDING (POTENTIAL). 

FT DOMAIN 247 287 EGF-LIKE 5, CALCIUM -BINDING (POTENTIAL). 



FT 


DOMAIN 


288 

£t O O 


«j j o 


Cjor j_ij.in.Cj d, v__*-u_i\ - . j. ui v j d J-IM-JUNo \ tr\jx Jiivi i ±t\±j} 


FT 


SITE 


54 


56 


fRTJ. ATTAPHMRNT ^TTP ( PriTFMTT AT ■ ) 


FT 


DISULFID 


131 


144 


BY SIMILARITY 

tJ X Ol 1*1 X XXnl\. X X X • 


FT 


DISULFID 


138 


153 


RY STMTT.ARTTY 

J-> X OX 1 ¥ 1 X XXTLXN. XXX* 


FT 


DISULFID 


155 


166 


RY STMTTiARTTY 

D 1 J X. 1*1 X XJinrv XXX* 


FT 


DISULFID 


172 


181 


RY 9TMTT.ARTTY 
jj i J x 1*1 x xxx* 


FT 


DISULFID 


177 


190 


BY SIMILARITY 

i-J X X 11 X XJirVTV X X X » 


FT 


DISULFID 


192 


205 


BY SIMILARITY 


FT 


DISULFID 


211 


221 


RY STMTTARTTY 

IJ 1 O X 1*1 X XJ£\i\. XXX. 


FT 


DISULFID 


217 


230 


RY STMTT.ARTTY 

Dl OX 11 X X-Lrt.i\. XXX. 


FT 


DISULFID 


232 


245 


RY STMTT.ARTTY 

Dl O X 1*1 X X-iC\l\ XXX. 


FT 


DISULFID 


251 


262 


BY SIMILARITY 


FT 


DISULFID 


258 


271 


BY SIMILARITY. 


FT 


DISULFID 


273 


286 


BY SIMILARITY 

Xi X Ul 11 X J LtilX. XXX. 


FT 


DISULFID 


292 


305 


RY STMTT.ARTTY 

OX OX 1*1 X XXrt.lv XXX. 


FT 


DISULFID 


299 


314 


BY SIMILARITY. 


FT 


DISULFID 


320 


332 


BY SIMILARITY. 


FT 


CARBOHYD 


283 


283 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


296 


296 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CONFLICT 


69 


70 


IP -> HS (IN REF. 3) . 


FT 


CONFLICT 


147 


148 


TE -> MK (IN REF. 3) . 


SQ 


SEQUENCE 


448 AA; 


50180 


MW; 19FCA51FDA328003 CRC64 ; 



Query Match 100.0%; Score 2407; DB 1; Length 448; 

Best Local Similarity 100.0%; Pred. No. l.le-171; 

Matches 423; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IMIMIIIIIMMMIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIMIIII 

Db 26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

iiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 86 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 145 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

IIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 146 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 205 

Qy 181 QDWEOVTENPCVQTCWTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

IIIIIIIIIMIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 206 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 265 

Qy 241 PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 300 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 266 PGTYFCSCPPGYI LLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DP I RCEEPYL 325 

Qy 301 RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

MMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 326 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEMI TVNTVI NFRGSS VI RLRI YVSQ 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 386 GNEGREFYMRQTGP I SATLVMTRPI KGPRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 445 



Qy 



421 YPF 423 
III 



Db 446 YPF 448 

RESULT 2 
FBL5_RAT 

ID FBL5_RAT STANDARD ; PRT; 44 8 AA. 

AC Q9WVH8; Q9R284; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-5 precursor (FIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) (Embryonic vascular EGF repeat -containing 

DE protein) (EVEC) . 

GN FBLN5 OR DANCE . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=99357779; PubMed= 10428823 ; 

RA Nakamura T. , Ruiz-Lozano P., Lindner V., Yabe D. , Taniwaki M . , 

RA Furukawa Y. , Kobuke K. , Tashiro K. , Lu 2., Andon N.L. , Schaub R. , 

RA Matsumori A., Sasayama S., Chien K.R., Honjo T.; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic, and balloon- injured arteries."; 

RL J. Biol. Chem. 274:22476-22483(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99278197; PubMed=10347091 ; 

RA Kowal R.C., Richardson J. A., Miano J.M., Olson E.N.; 

RT "EVEC, a novel epidermal growth factor-like repeat -containing protein 

RT upregulated in embryonic and diseased adult vasculature."; 

RL Circ. Res. 84:1166-1176(1999). 

CC -!- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT AND REMODELING. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF112153; AAD41769.1; 

DR EMBL; AF137350; AAD25101.1; -. 

DR HSSP; P00736; 1APQ. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 4. 



DR 


SMART; SM00179; EGF CA; 4 








DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


4 . 




DR 


PROSITE; 


PS00022; 


EGF_1 ; 


FALSE NEG. 




DR 


PROSITE; 


PS01186; 


EGF 2; 


4 . 






DR 


PROSITE; 


PS01187; 


EGF CA; 


6. 






KW 


Cell adhesion; Calcium-binding; 


Repeat; Signal; 


EGF -like domain; 


KW 


Glycoprotein. 










FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


CHAIN 


24 


448 


FIBULIN-5. 




FT 


DOMAIN 


24 


69 


EGF -LIKE 1, DIVERGENT. 


FT 


DOMAIN 


127 ' 


167 


EGF -LIKE 2, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


168 


206 


EGF -LIKE 3, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


207 


246 


EGF -LIKE 4, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


247 


287 


EGF -LIKE 5, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


DOMAIN 


288 


333 


• EGF -LIKE 6, CALCIUM 


-BINDING (POTENTIAL) . 


FT 


SITE 


54 


56 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


DISULFID 


131 


144 


BY 


SIMILARITY. 




FT 


DISULFID 


138 


153 


BY 


SIMILARITY. 




FT 


DISULFID 


155 


166 


BY 


SIMILARITY. 




FT 


DISULFID 


172 


181 


BY 


SIMILARITY. 




FT 


DISULFID 


177 


190 


BY 


SIMILARITY. 




FT 


DISULFID 


192 


205 


BY 


SIMILARITY. 




FT 


DISULFID 


211 


221 


BY 


SIMILARITY. 




FT 


DISULFID 


217 


230 


BY 


SIMILARITY. 




FT 


DISULFID 


232 


245 


BY 


SIMILARITY. 




FT 


DISULFID 


251 


262 


BY 


SIMILARITY. 




FT 


DISULFID 


258 


271 


BY 


SIMILARITY. 




FT 


DISULFID 


273 


286 


BY 


SIMILARITY. 




FT 


DISULFID 


. 292 


305 


BY 


SIMILARITY. 




FT 


DISULFID 


299 


314 


BY 


SIMILARITY. 




FT 


DISULFID 


320 


332 


BY 


SIMILARITY. 




FT 


CARBOHYD 


283 


283 


N- LINKED (GLCNAC . . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


296 


296 


N -LINKED (GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CONFLICT 


238 


238 


L - 


> P (IN REF. 2) . 




SQ 


SEQUENCE 


448 AA; 


50160 


MW; E6BC68F7BF14B714 CRC64 ; . 



Query Match 95.7%; Score 2303; DB 1; Length 448; 

Best Local Similarity 94.8%; Pred. No. 5.5e-164; 

Matches 401; Conservative 10; Mismatches 12; Indels 0; 



Qy 

Db 



Gaps 



0; 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

IIIMIIIIIhlMIMIIIIIIIIMMIIIIIIIIIIIIIIIIIIIMIIIIIIMI 

26 QCTNGFDLDRQTGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 

Db 

Qy 

Db 

Qy 
Db 



61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 12 0 

I Mllllllllh I llllllllhlllllllll lllllllllllllllllllllll 

86 TSYSGPYPAAAPPVPASNYPTI SRPLVCRFGYQMDEGNQCVDVDECATDSHQCNPTQI CI 145 
121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I II 1 1 II II 1 1 Ml II I Mill I MM Mill II Ml II II MM III I INI hllll I - 

146 NTEGGYTCSCTDGYWLLEGQCLD I DECRYGYCQQLCANVPGS YSCTCNPGFTLNDDGRSC 205 



181 



240 



QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 

Ml Ml MlfMIMMMMMMMMMMMhlMMMMMMIMMMM 

206 QDVNECETENPCVQTCVNTYGSFICRCDPGYELEEDGIHCSDMDECSFSEFLCQHECVNQ 265 



Qy 



24 1 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 



Ihllllllllhlhllllllllllllllllll IIIIIIIIIIIIIIII 1 1 1 1 1 1 

Db 266 PGSYFCSCPPGYVLLEDNRSCQDINECEHRNHTCTPLQTCYNLQGGFKCIDPIVCEEPYL 325 

Qy 301 RISDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADIFQMQATTRYPGAYYIFQIKS 360 

I Illlllllll IIIIIIMIhMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 326 LIGDNRCMCPAENTGCRDQPFTILFRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRI YVSQ 420 

IIIIIIIIIMIIIIMIMIIIIIMIhMIIIIMIIIIMIIIIMIIIIIIMII 

Db 386 GNEGREFYMRQTGP I SATLVMTRP I KGPRDI QLDLEMITVNTVINFRGSS VI RLRI YVSQ 445 

Qy 421 YPF 423 

Ml 

Db 446 YPF 448 



RESULT 3 
FBL5 MOUSE 



ID FBL5_MOUSE STANDARD; PRT; 44 8 AA. 

AC Q9WVH9 ; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-5 precursor (FIBL-5) (Developmental arteries and neural crest 

DE EGF-like protein) (Dance) . 

GN FBLN5 OR DANCE. 

.OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; ' Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=99357779; PubMed=10428823 ; 

RA Nakamura T. , Ruiz-Lozano P., Lindner V., Yabe D. , Taniwaki M. , 

RA Furukawa Y., Kobuke K. , Tashiro K. , Lu Z., Andon N.L., Schaub R., 

RA Matsumori A., Sasayama S., Chien K.R., Honjo T. ; 

RT "DANCE, a novel secreted RGD protein expressed in developing, 

RT atherosclerotic, and balloon-injured arteries."; 

RL J. Biol. Chem. 274:22476-22483(1999). 

CC -!- FUNCTION: PROMOTES ADHESION OF ENDOTHELIAL CELLS THROUGH 

CC INTERACTION OF INTEGRINS AND THE RGD MOTIF. COULD BE A VASCULAR 

CC LIGAND FOR INTEGRIN RECEPTORS AND MAY PLAY A ROLE IN VASCULAR 

CC DEVELOPMENT AND REMODELING. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF112151; AAD41767.1; -. 

DR HSSP; P00736; 1APQ. 



DR 


MGD; MGI: 


1346091; 


FblnS . 






DR 


InterPro; 


I PRO 00152; Asx_hydroxyl . 


DR 


InterPro; 


IPR001881; EGF_ 


Ca. 




DR 


InterPro; 


IPR006209; EGF 


like. 




DR 


Pfam; PF00008; EGF; 4. 






DR 


SMART; SM00179; EGF CA; 4 






DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


4 . 


DR 


PROSITE; 


PS00022; 


EGF_1 ; 


FALSE NEG. 


DR 


PROSITE; 


PS01186; 


EGF 2; 


4 . 




DR 


PROSITE; 


PS01187; 


EGF CA; 


6. 




KW 


Cell adhesion; Calcium-binding; 


Repeat; Signal; EGF-like domain; 


KW 


Glycoprotein. 








FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


CHAIN 


24 


44ft 


FIBULIN-5. 


FT 


DOMAIN 


24 


69 


EGF -LIKE 1, DIVERGENT. 


FT 


DOMAIN 


127 


-L O / 


EGF -LIKE 2, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


168 


206 


EGF -LIKE 3, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


207 


24 6 


EGF 


'-LIKE 4, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


247 


9ft 7 


EGF 


'-LIKE 5, CALCIUM-BINDING (POTENTIAL) 


FT 


DOMAIN 


288 


jjj 


EGF 


'-LIKE 6, CALCIUM-BINDING (POTENTIAL) 


FT 


SITE 


54 


56 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


DISULFID 


131 


144 


BY 


SIMILARITY. 


FT 


DISULFID 


138 


153 


BY 


SIMILARITY. 


FT 


DISULFID 


155 


166 


BY 


SIMILARITY. 


FT 


DISULFID 


172 


181 


BY 


SIMILARITY. 


FT 


DISULFID 


177 


190 


BY 


SIMILARITY. 


FT 


DISULFID 


192 


205 


BY 


SIMILARITY. 


FT 


DISULFID 


211 


221 


BY 


SIMILARITY. 


FT 


DISULFID 


217 


230 


BY 


SIMILARITY. 


FT 


DISULFID 


232 


245 


BY 


SIMILARITY. 


FT 


DISULFID 


251 


262 


BY 


SIMILARITY. 


FT 


DISULFID 


258 


271 


BY 


SIMILARITY. 


FT 


DISULFID 


273 


286 


BY 


SIMILARITY. 


FT 


DISULFID 


292 


305 


BY 


SIMILARITY. 


FT 


DISULFID 


299 


314 


BY 


SIMILARITY. 


FT 


DISULFID 


320 


332 


BY 


SIMILARITY. 


FT 


CARBOHYD 


283 


283 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


296 


296 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


448 AA; 


50193 


MW; F15CC70CCFBFDC97 CRC64 ; 



Query Match 95 . 6%; 

Best Local Similarity 94.8%; 
Matches 4 01; Conservative 



Score 2302; DB 1; Length 448; 
Pred. No. 6.6e-164; 
9; Mismatches 13; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I III INN 1 1 1 1 1 1 II 1 1 1 II 1 1 1 lllllll 1 1 1 1 1 1 II 1 1 1 MM II I II I! I Mill 

26 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 85 



Qy 

Db 

Qy 
Db 



61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

I Mill ■ M I : M I Mill U Mill MIM MM IM 

86 TSYSGPYPAAAPPVPASNYPTI SRPLVCRFGYQMDEGNQCVDVDECATDSHQCNPTQI CI 145 
121 NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I M il Ml Ml Mi! ' :M II I I I i I II. I M 

146 NTEGGYTCSCTDGYWLLEGQCLDI DECRYGYCQQLCANVPGSYSCTCNPGFTLNDDGRSC 205 



Qy 



181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 



Db 


206 


Qy 


241 


Db 


266 


Qy 


301 


Db 


326 


Qy 


361 


Db 


386 


Qy 


421 


Db 


446 



MINI IIIIIIIIIIIIIIIIIIIIIIIIIMIIhllllllllllllllllMIIII 

QDVNECETENPCVQTCVNTYGSFICRCDPGYELEEDGIHCSDMDECSFSEFLCQHECVNQ 265 
PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

IhllllllllhllMIMIIIIIIMIIIIII IMIMIIIIIIIIII llllll 

PGS YF CS CP PG Y VLLDDNRS CQD I NECEHRNHTCTSLQTC YNLQGGF KCI DP I S CEEP YL 325 



■■I IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

LI GENRCMCPAEHTSCRDQPFTI LYRDMD WSGRSVPADI FQMQATTRYPGAYYI FQI KS 385 
GNEGRE F YMRQTG P I SATLVMTRP I KG PRE I QLDLEMI TVNTVI NFRGSSVI RLRI YVSQ 420 

IIIIIIIMIIIIIMIIMIIIMIIIhlllMIIIMIIMIIIMIIIIIIIIIII 



YPF 423 
III 

446 YPF 448 



RESULT 4 
FBL4_CRIGR 

ID FBL4_CRIGR STANDARD; PRT; 443 AA. 

AC 055058; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibulin-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (H411 protein) . 

GN EFEMP2 OR FBLN4 . 

OS Cricetulus griseus (Chinese hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Cricetulus. 

OX NCBI_TaxID=10029; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary ; 

RA Heine H. , Delude R.L., Monks B. # Golenbock D.T.; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF046870; AAC03101.1; -. 

DR HSSP; P00736; 1APQ. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro ; I PRO 01881; EGF_Ca . 

DR InterPro; IPR006209; EGF like. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR001491; Thrmbomoduln . 
Pfam; PF00008; EGF; 4. 
PRINTS; PR00907; THRMBOMODULN. 
SMART; SM00179; EGF_CA; 4. 
PROSITE; PS00010; ASXJ4YDROXYL ; 4. 
PROSITE; PS 00 022; EGF_1 ; FALSE_NEG . 
PROSITE; PS01186; EGF_2 ; 4. 
PROSITE; PS01187; EGF__CA; 6. 

Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal. 



SIGNAL 
CHAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

CARBOHYD 

CARBOHYD 

SEQUENCE 



1 
26 

36 
123 
164 
203 
243 
283 
127 
134 
151 
168 
173 
188 
207 
213 
228 
247 
254 
269 
287 
294 
315 
198 
394 
443 AA; 



25 
443 

81 
163 
202 
242 
282 
328 
140 
149 
162 
177 
186 
201 
217 
226 
241 
258 
267 
281 
300 
309 
327 
198 
394 

49432 



POTENTIAL. 

EGF -CONTAINING FIBULIN-LIKE EXTRACELLULAR 
MATRIX PROTEIN 2. 



EGF-LIKE 1, 
EGF-LIKE 2, 
EGF-LIKE 3, 
EGF-LIKE 4, 
EGF-LIKE 5, 
EGF-LIKE 6, 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED ( GLCNAC . 
N-LINKED (GLCNAC. 



DIVERGENT. 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 



(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 



(POTENTIAL) 
(POTENTIAL) 



MW; 0BCFE5D7323D9E5F CRC64 ; 



Query Match 53.0%; Score 1276.5; DB 1; Length 443; 

Best Local Similarity 52.2%; Pred. No. 8.4e-88; 

Matches 221; Conservative 64 ; Mismatches 121; Indels 17; 



Gaps 



4; 



Qy 

Db 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 
' : I I I h:|| hi Ml I 'I I 
38 ECTDGYEWDADSQHCRDVNECLTI PEACKGEMKCINHYGGYLCLPRSAAVTNDLHG 93 



Qy 

Db 

Qy 
Db 

Qy 
Db 



61 



94 



121 



142 



181 



202 



TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICI 12 0 

M I II II I Ih II llllllll I I hi I 
EGP-PPPVPPAQHPN PCPPGYEPDEQESCVDVDECAQALHDCRPSQDCH 141 



NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 

I M hi Ml = :hlllllll III I hllh I Mill : III 
NLPGSYQCTCPDGYRKVGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 



180 



201 



QDVNECATENPCVQTCWTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

Hill M I I hlhhllh I I I I II llhlllhl II |||: 

VDVNECDMGAPCEQRCFNSYGTFLCRCNQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 



Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

II = I II II II I Mlhll I h III I lh:|:| II I h 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECETGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYV 320 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

-MM || 11 = 11 :|::| I : I I I I ! ||||| I 

Db 321 QVSDNRCFCPVSNPLCREQPSS I VHRYMS I TSERSVPADVFQI QATS VYPGAYNAFQI RA 380 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 

II :|hll :|| 11= lh I I I I I I I I I = I = I = = : : : I Mhll -I 
Db 381 GNTQGDF Y I RQ I NNVSAMLVLAR P VTGPRE YVLDLEM VTMNSLMS YRAS S VLRLTVF VGA 44 0 

Qy 421 YPF 423 

II- 
Db 441 YTF 443 



RESULT 5 
FBL4_HUMAN 

ID FBL4_HUMAN STANDARD; PRT; 443 AA. 

AC 095967; 075967; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF- containing fibulin-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (UPH1 protein) . 

GN EFEMP2 OR FBLN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Melanoma; 

RX MEDLINE=20068041; PubMed=106 01734 ; 

RA Giltay R. , Timpl R., Kostka G.; 

RT "Sequence, recombinant expression and tissue localization of two novel 

RT extracellular matrix proteins, fibulin-3 and fibulin-4."; 

RL Matrix Biol. 18:469-480(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Zemel R . , Shaul Y. ; 

RL Submitted (SEP-1998) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20435063; PubMed=10982 184 ; 

RA Katsanis N., Venable S., Smith J.R., Lupski J.R.; 

RT "Isolation of a paralog of the Doyne honeycomb retinal dystrophy gene 

RT from the multiple retinopathy critical region on llql3."; 

RL Hum. Genet. 106:66-72(2000). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=124 77932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 



RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M. J. , Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G. , 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J . E . , Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci". U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AJ132819; CAA10791.2; -. 

DR EMBL; AF093119; AAC62108-. 1; -. 

DR EMBL; AF109121; AAF65188.1; -. 

DR EMBL; BC010456; AAH10456.1; -. 

DR HSSP; P35555; 1EMN. 

DR Genew; HGNC:3219; EFEMP2 . 

DR MIM; 604633; -. 

DR GO; GO:0005604; C:basement membrane; TAS. 

DR GO; GO:0005201; F : extracellular matrix structural constituent; TAS. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; I PRO 01491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PR00907; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal. 

FT SIGNAL 1 2 5 POTENTIAL. 

FT CHAIN 26 443 EGF -CONTAINING FIBULIN-LIKE EXTRACELLULAR 

FT MATRIX PROTEIN 2. 

FT DOMAIN 36 81 EGF-LIKE 1, DIVERGENT. 

FT DOMAIN 123 163 EGF-LIKE 2, CALCIUM-BINDING (POTENTIAL) . 

FT DOMAIN 164 202 EGF-LIKE 3, CALCIUM-BINDING (POTENTIAL). 

FT DOMAIN 203 242 EGF-LIKE 4, CALCIUM -BINDING (POTENTIAL). 
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CONFLICT 
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A ■ 


-> T (IN REF. 1) . 


FT 


CONFLICT 


44 


51 


EWDPDSQH -> TQTAN (IN REF . 2) . 


FT 


CONFLICT 


103 


111 


AQHPNPCPP -> VNTQPLPT (IN REF. 2) . 


FT 


CONFLICT 


294 


294 


C - 


-> W (IN REF, 2) . 


FT 


CONFLICT 


354 


356 


RSV -> AER (IN REF . 2) . 


FT 


CONFLICT 


355 


355 


S ■ 


-> R (IN REF. 3) . 


SQ 


SEQUENCE 


443 AA; 
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MW; 9E9AC2393780D3B8 CRC64; 



Query Match 53.0%; Score 1276.5; DB 1; Length 443; 

Best Local Similarity 52.0%; Pred. No. 8.4e-88; 

Matches 220; Conservative 67; Mismatches 119; Indels 17; Gaps 4; 



Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= 11 = 1 = = I I I i , |:| ,,,,!■■■■ | 
Db 38 ECTDGYEWDPDSQHCRDVNECLTIPEACKGEMKCINHYGGYLCLPRSAAVINDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQC VDVDECATDSHQCNPTQI CI 120 

M I M II I Ih h : llllllll I I 1 = 1 I 

Db 94 ---EGP-PPPVPPAQHPN PCPPGYEPDDQDSCVDVDECAQALHDCRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 18 0 

I I I hi III = : hi II 111 I III I hi Ih I I Ml | : Ml 
Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

Mill II I I hlhhlll I I I I II Mhlllhl :||h |||: 

Db 202 VDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

M = I M M II I lllhlll I h Ml I lh:hl II llh 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYI 320 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

= = 1 = 111 = 111 II Ihll I | : | MMIhlhllh Mill llh: 

Db 321 QVSENRCLCPASNPLCREQPSS I VHRYMTI TSERSVPADVFQI QATSVY PGAYNAFQI RA 380 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 

M =11 = 11 Ml 11= 11= MM IIMhhh = = = =| I I I : I I =M 



Db 381 GNSQGDFYIRQINNVSAMLVIARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGA 44 0 



Qy 



Db 



421 YPF 423 
I I 

441 YTF 443 



RESULT 6 
FBL4_MOUSE 

ID FBL4_MOUSE STANDARD ; PRT; 443 AA. 

AC Q9WVJ9 ; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibul in-like extracellular matrix protein 2 precursor 

DE (Fibulin-4) (FIBL-4) (Mutant p53 binding protein 1) . 

GN EFEMP2 OR FBLN4 OR MBP1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata,- Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D= 10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=99308589; PubMed=10380882 ; 

RA Gallagher W.M. , Argentini M . , Sierra V., Bracco L., Debussche L., 

RA Conseiller E . ; 

RT "MBPl: a novel mutant p53-specific protein partner with oncogenic 

RT properties."; 

RL Oncogene 18:3608-3616(1999). 

CC -!- SUBUNIT: BINDS PREFERENTIALLY TO P53 MUTANTS. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; AF104223; AAD45219.1; -. 

DR HSSP; POO 73 6; 1APQ. 

DR MGD; MGI : 1891209; Ef emp2 . 

DR InterPro; IPR000152; Asxjiydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; I PRO 01491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PR00907; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROX YL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 



KW Repeat; EGF- 

FT SIGNAL 

FT CHAIN 
FT 

FT DOMAIN 

FT DOMAIN 

FT DOMAIN 

FT DOMAIN 

FT DOMAIN 

FT DOMAIN 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT DISULFID 

FT CARBOHYD 

FT CARBOHYD 

SQ SEQUENCE 



like domain; 



1 
26 

36 
123 
164 
203 
243 
283 
127 
134 
151 
168 
173 
188 
207 
213 
228 
247 
254 
269 
287 
294 
315 
198 
394 

143 AA; 



25 
443 

81 
163 
202 
242 
282 
328 
140 
149 
162 
177 
186 
201 
217 
226 
241 
258 
267 
281 
300 
309 
327 
198 
394 

4942 



Calcium-binding; Glycoprotein; Signal . 
POTENTIAL. 

EGF- CONTAINING FIBULIN-LIKE EXTRACELLULAR 
MATRIX PROTEIN 2. 



EGF -LIKE 1, 
EGF -LIKE 2, 
EGF -LIKE 3, 
EGF -LIKE 4, 
EGF-LIKE 5, 
EGF-LIKE 6, 
BY SIMILARITY. 
BY SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
N- LINKED (GLCNAC. 
N- LINKED (GLCNAC. 



DIVERGENT. 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 



BY 
BY 
BY 
BY 
BY 
BY 
BY 



BY 
BY 
BY 
BY 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



.) 
.) 



(POTENTIAL) 
(POTENTIAL) 



5 MW; 4969C0328A23DD88 CRC64 ; 



Query Match 52.9%; Score 1272.5; DB 1; Length 443; 

Best Local Similarity 52.0%; Pred. No. 1.7e-87; 

Matches 220; Conservative 65; Mismatches 121; Indels 17; Gaps 3; 



Qy 1 QCTNGFDLDRQSGQCLDI DECRTI P EACRGDMM CVNQNGG YLC I PRTNPVYRGPYSNPYS 6 0 

IN- | | | I 1)1111 = 1 = 1 hi llllhlh I 

Db 38 ECTDGYEWDADSQHCRDVNECLTI PEACKGEMKCINHYGGYLCLPRSAAVI SDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

I I Ml I 1 II lllllll I I hi I 

Db 94 ---EGPPPPAA HAQQPNPCPQGYEPDEQESCVDVDECTQALHDCRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi III : MMIIIIII III I hllh I I III I : III 

Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 240 

Mill II I I hlhhllh I I I I II llhlll ! :||h llh 

Db 202 VDVNECDMGAPCEQRCFNSYGTFLCRCNQGYELHRDGFSCSDIDECGYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

II : I II II II I lllhlll | h Ml | ||::|:| || |||: 

Db 2 62 PGRFSCHCPQGYQLL-ATRLCQDIDECETGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYV 320 

Qy 3 01 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

- Ml I II hi ! I : I lllllhlhllh Mill llhl 

Db 321 QVSDNRCLCPASNPLCREQPSS I VHRYMS I TSERSVPADVFQI QATSVYPGAYNAFQI RS 380 



Qy 361 GNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDLEMI TVNTVT NFRGSSVI RLRI YVSQ 420 

II HMI HI Ih Ih II I I llllhhh::::! llhll 
Db 381 GNTQGDFYIRQINNVSAMLVIARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGA 440 

Qy 421 YPF 423 

I I 

Db 441 YTF 443 



RESULT 7 
FBL3_RAT 

ID FBL3_RAT STANDARD; PRT; 4 93 AA. 

AC 035568; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibulin-like extracellular matrix protein 1 precursor 

DE (Fibulin-3) (FIBL-3) (T16 protein) . 

GN EFEMP1 OR FBLN3 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxI D= 10116; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Lung; 

RX MEDLINE=97415782; PubMed=9268 694 ; 

RA Ozaki T. , Kondo K. , Nakamura Y. , Ichimiya S., Nakagawara A., 

RA Sakiyama S . ; 

RT "Interaction of DA41, a DAN-binding protein, with the epidermal growth 

RT factor-like protein, S(l-5). n ,- 

RL Biochem. Biophys. Res. Commun. 237:245-250(1997). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; D89730; BAA22265.1; -. 

DR PIR; JC5621; JC5621. 

DR HSSP; P35555; 1EMN. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM0 017 9; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW .Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal. 
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FT 


DISULFID 
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FT 


DISULFID 


279 
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FT 


DISULFID 


298 


309 


FT 


DISULFID 


305 
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FT 


DISULFID 


320 


332 


FT 


DISULFID 


338 


350 


FT 


DISULFID 


344 
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FT 


DISULFID 


365 
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SQ 


SEQUENCE 


493 AA; 


54 



POTENTIAL . 

EGF - CONTAI N I NG FIBULIN-LIKE EXTRACELLULAR 

MATRIX PROTEIN 1. 

EGF -LIKE 1, DIVERGENT. 

CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 
CALCIUM-BINDING 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



EGF -LIKE 2, 
EGF -LIKE 3, 
EGF -LIKE 4, 
EGF-LIKE 5, 
EGF -LIKE 6, 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
MW; 22DAFD70BACF1CA5 CRC64 ; 

Query Match 46.1%; Score 1108.5; DB 1; Length 493; 

Best Local Similarity 43.6%; Pred. No. 2.7e-75; 

Matches 204; Conservative 68; Mismatches 149; Indels 47; Gaps 



Qy 
Db 

Qy 

Db 

Qy 
Db 



5; 



1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNP YS 6 0 

' I II Mill -!M I III "I . I I : 

28 QCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGYLCLPKTAQIIVNNEQPQQE 87 

61 TPYS GPYPAAAPPLSAPNYPT 81 

M : I = hi - I I 

88 TPAAEASSGAATGTI AARSMATSGVI PGGGFIASATAVAGPEVQTGRNNFVI RRNPADPQ 147 

82 ISRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGYWL 136 

I : I I I : III hill : :| I hill I ' | || 

148 RI PSNPSHRI QCAAGYEQSEHNVCQDI DECTSGTHNCRLDQVCINLRGSFTCHCLPGYQK 2 07 



Qy 137 LEGQCLDI DECRY -GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQT 195 

IhMIII II I I I llh I Mill I : M hill I I I 

Db 208 RGEQCVDIDECSVPPYCHQGCWTPGSFYCQCNPGFQIAANNYTCVDINECDASNQCAQQ 267 

Qy 196 CVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILL 255 

I I M - I I I I I :H hill I :|lh:|lhll = I II II - 

Db - 268 CYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYQW 327 

Qy 256 DDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYLRI SDNRCMCPAENPG 315 

'MINIMI || : |:| |||:| |::||: |:|||:|| | 

Db 328 - RSRTCQDI NECETTNE - CREDEMCWNYHGGFRCYPQNPCQDPYVLTSENRCVCPVSNTM 385 

Qy . 316 CRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI 375 

Ml I =hh M I lllhlllhllll I hllllll MhlM h 

Db 386 CRDVPQSIVYKYMNIRSDRSVPSDIFQIQATTI YANTINTFRIKSGNENGEFYLRQTSPV 445 



Qy 376 SATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 423 

M Ih : : I I I I : 1^ || MhM I | = | 

Db 44 6 S AMLVLVKSLTG PREH I VGLEMLTVS S I GTFRTS S VLRLT 1 1 VG P FS F 4 93 



RESULT 8 
FBL3_HUMAN 

ID FBL3_HUMAN STANDARD; PRT; . 493 AA. 

AC Q12805; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE EGF-containing fibulin-like extracellular matrix protein 1 precursor 

DE (Fibulin-3) (FIBL-3) (Fibrillin-1 ike protein) (Extracellular protein 

DE Sl-5) . 

GN EFEMP1 OR FBLN3 OR FBNL . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND POSSIBLE ALTERNATIVE SPLICING. 

RC TISSUE=Skin; 

RX MEDLINE=95097983; PubMed=7799918 ; 

RA Lecka-Czernik B., Lumpkin C.K. Jr., Goldstein S.; 

RT "An overexpressed gene transcript in senescent and quiescent human 

RT fibroblasts encoding a novel protein in the epidermal growth factor- 

RT like repeat family stimulates DNA synthesis."; 

RL Mol. Cell. Biol. 15:120-128(1995). 

RN [2] 

RP SEQUENCE. FROM N. A. 

RX MEDLINE=97001163; PubMed=88124 96 ; 

RA Ikegawa S. # Toda T. , Okui K. , Nakamura Y. ; 

RT "Structure and chromosomal assignment of the human Sl-5 gene (FBNL) 

RT that is highly homologous to fibrillin."; 

RL Genomics 35 : 590-592 (1996) . 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20068041; PubMed=10601734 ; 

RA Giltay R. , Timpl R. , Kostka G. ; 

RT "Sequence, recombinant expression and tissue localization of two novel 

RT extracellular matrix proteins, fibulin-3 and fibulin-4."; 

RL Matrix Biol. 18:469-480(1999). 

RN [4] 

RP VARIANT DHRD/MLVT TRP-345, AND VARIANT PHE-220. 

RX MEDLINE=99295941; PubMed=10369267 ; 

RA Stone E.M., Lotery A.J. , Munier F.L., Heon E. , Piguet B., Guymer R.H., 

RA Vandenburgh K. , Cousin P., Nishimura D. , Swiderski R.E., Silvestri G. , 

RA Mackey D.A., Hagerman G.S., Bird A.C., Sheffield V.C., 

RA Schorderet D.F.; 

RT "A single EFEMP1 mutation associated with both malattia Leventinese 

RT and Doyne honeycomb retinal dystrophy."; 

RL Nat. Genet. 22:199-202(1999). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=4; 



CC Comment=Experimental confirmation may be lacking for some 

CC isoforms; 
CC Name=l; 

CC I sol d=Q12 8 05-1; Sequence=Di splayed ; 

CC Name=2 ; 

CC IsoId=Q12805-2; Sequence=VSP_001392 ; 

CC Name=3 ; 

CC IsoId=Q12805-3; Sequence=VSP_001393 ; 

CC Name=4 ; 

CC IsoId=Q12805-4; Sequence=VSP_001394 ; 

CC -!- DISEASE: DEFECTS IN EFEMP1 ARE A CAUSE OF DOYNE HONEYCOMB RETINAL 
CC DYSTROPHY (DHRD) ALSO KNOWN AS MALATTIA LEVENTINESE (MLVT OR ML) , 

CC AN AUTOSOMAL DOMINANT DISEASE CHARACTERIZED BY YELLOW -WHITE 

CC DEPOSITS KNOWN AS DRUSEN THAT ACCUMULATE BENEATH THE RETINAL 

CC PIGMENT EPITHELIUM. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 6 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; U03877; AAA65590.1; -. 

DR HSSP; P35555; 1EMN . 

DR Genew; HGNC:3218; EFEMP1 . 

DR MIM; 601548; 

DR MIM; 126600; -. 

DR GO; GO:0005578; C: extracellular matrix; TAS. 

DR GO; GO: 0007601; P: vision; TAS. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW Repeat; EGF-like domain; Calcium-binding; Glycoprotein; Signal; 

KW Disease mutation; Polymorphism; Alternative splicing. 
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SEQUENCE 


493 AA; 


54640 


MW; 128CA5ED140DF414 CRC64; 



Query Match 45.4%; Score 1093.5; DB 1; Length 493; 

Best Local Similarity 43.2%; Pred. No. 3.6e-74; 

Matches 203; Conservative 64; Mismatches 152; Indels 51; Gaps 7; 

QY 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCIPRTNPVY RGPYSN 57 

' I'- | || HIM :|:||:| | ||| |||||:|:| : | 

Db 2 8 QCTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGYLCLPKTAQI IVNNEQPQQE 87 

Qy 58 PY STPYSGPYPAA APPL 74 

I M I II 
Db 88 TQPAEGTSGATTGWAASSMATSGVLPGGGFVASAAAVAGPEMQTGRNNFVIRRNPADPQ 147 

QY 75 SAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CINTEGGYTCSCTDGY 134 

h I I : I 11= I I I hill H I hill I || 

Db 148 RI PSNP- -SHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQCPPGY 205 

Qy 13 5 WLLEGQCLDI DECRY -GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCV 193 

Ihlllll II I I I Mh I hill I : ' hill I I 

Db 206 QKRGEQCVDI DECTI PPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDASNQCA 265 

Qy 194 QTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYI 253 

I I I IMIhh I II I I ::| hill | :||h:||hll = I II || 
°b 266 QQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYQ 325 

Qy 254 LLDDNRS CQD I NECEHRNHTCNLQQTC YNLQGGFKC I D P I RCE E P YLR I S DNRCMCPAEN 313 

- 'MINIMI I I : hi | Ih | hHh Mlhll I 

Db 326 W-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVCPVSN 383 

Qy 314 PGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTG 373 

I h I : h h I : I 1 1 1 h 1 1 I h 1 1 1 1 I h I 1 1 I II II hill 

Db 384 AMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTI YANTINTFRIKSGNENGEFYLRQTS 443 



Qy 



374 P I S ATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRGS S VI RLR I Y VSQ Y PF 423 



Db 444 P VSAMLVLVKSLSG PREH I VDLEMLTVS S I GTFRTSS VLRLT 1 1 VG PFS F 4 93 



RESULT 9 
FBL2_M0USE 

ID FBL2_M0USE STANDARD; PRT; 1221 AA. 

AC P37889; Q9WUI2; 

DT 01-OCT-1994 (Rel. 30, Created) 

DT 01-OCT-1994 (Rel. 30, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-2 precursor. 

GN FBLN2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 27-35. 

RC TISSUE=Fibroblast ; 

RX MEDLINE=94064787; PubMed=824 5130 ; 

RA Pan T.-C, Sasaki T. , Zhang R.-Z., Faessler R., Timpl R. , Chu M.-L.; 

RT "Structure and expression of fibulin-2, a novel extracellular matrix 

RT protein with multiple EGF-like repeats and consensus motifs for 

RT calcium binding. " ; 

RL J. Cell Biol. 123:1269-1277(1993). 

RN [2] 

RP SEQUENCE FROM N.A., AND ALTERNATIVE SPLICING. 

RX MEDLINE=99337686; PubMed=l 0406956 ; 

RA Graessel S., Sicot F.-X., Gotta S., Chu M.-L.; 

RT "Mouse fibulin-2 gene. Complete exon-intron organization and promoter 

RT characterization. " ; 

RL Eur. J. Biochem. 263:471-477(1999). 

RN [3] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=8850569; 

RA Zhang H.-Y., Timpl R. , Sasaki T., Chu M.-L., Ekblom P.; 

RT "Fibulin-l and fibulin-2 expression during organogenesis in the 

RT developing mouse embryo."; 

RL Dev. Dyn. 205:348-364(1996). 

RN [4] 

RP BINDING TO LAMA2 . 

RX PubMed=10022829; 

RA Talts J.F., Andac Z., Goehring W. , Brancaccio A., Timpl R.; 

RT "Binding of the G domains of laminin alphal and alpha2 chains and 

RT perlecan to heparin, sulfatides, alpha -dystroglycan and several 

RT extracellular matrix proteins. ",- 

RL EMBO J. 18:863-870(1999). 

RN [5] 

RP DOWN -REGULATION BY GLUCOCORTICOIDS. 

RX PubMed=11737251 ; 

RA Gu Y.-C, Talts J.F., Gullberg D. , Timpl R. , Ekblom M. ; 

RT "Glucocorticoids down-regulate the extracellular matrix proteins 

RT fibronectin, fibulin-l and fibulin-2 in bone marrow stroma."; 

RL Eur. J. Haematol. 67:176-184(2001). 

CC -!- FUNCTION: ITS BINDING TO FIBRONECTIN AND SOME OTHER LIGANDS IS 
CC CALCIUM DEPENDENT. 
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SUBUNIT: Homotrimer ; disulf ide-linked . Interacts with LAMA2 . 
SUBCELLULAR LOCATION: Secreted; extracellular matrix. 
ALTERNATIVE PRODUCTS: 

Event ^Alternative splicing; Named isoforms=2; 

Comment =Additional isoforms seem to exist; 
Name=l ; 

I sol d=P3 7 8 8 9 - 1 ; Sequence=Di spl ayed ; 
Name=2 ; Synonyms =EGF3 -less; 

IsoId=P37889-2; Sequence=VSP_001391 ; 
TISSUE SPECIFICITY: Component of both basement membranes and other 
connective tissues. 

DEVELOPMENTAL STAGE: The differential expression of the fibulin 
family contributes to the formation of molecularly distinct 
extracellular matrices already during early developmental stages 
of a large number of tissues. 

INDUCTION: Glucocorticoids suppressed mRNA expression and protein 
synthesis . 



SIMILARITY 
SIMILARITY 
SIMILARITY 



Belongs to the fibulin family. 
Contains 3 anaphylatoxin-like domains. 
Contains 11 EGF-like domains. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinformatics and the EMBL outstation - 
the European Bioinformatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 



EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 



X75285; CAA53040.1, 



AF135253 
AF135239 
AF135240 
AF135241 
AF135242 
AF135243 
AF135244 
AF135245 
AF135246 
AF135247 
AF135248 
AF135249 
AF135250 
AF135251 
AF135252 
PIR; A49457; A49457. 
HSSP; P00736; 1APQ. 
MGD; MGI: 95488; Fbln2 . 
InterPro; IPR000020; Anaphylatoxin . 
InterPro; IPR000152; Asx__hydroxyl . 
InterPro; IPR001881; EGF_Ca . 
InterPro; IPR006209; EGF_like. 
Pfam; PF01821; ANATO; 2. 
Pfam; PF00008; EGF; 6. 
SMART; SM00104; ANATO; 3. 
SMART; SM00179; EGF_CA; 9. 
PROSITE; PS00010; ASX HYDROXYL; 5. 



AAD34456 . 1 
AAD34456 . 1 
AAD34456 . 1 
AAD34456.1 
AAD34456 . 1 
AAD34456.1 
AAD34456. 1 
AAD34456. 1 
AAD34456 . 1 
AAD34456 . 1 
AAD34456.1 
AAD34456 . 1 
AAD34456 . 1 
AAD34456 . 1 
AAD34456 . 1 



JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 



DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 3. 

DR PROSITE; PS00022; EGF_1; FALSE_NEG . 

DR PROSITE; PS01186; EGF_2 ; 5. 

DR PROSITE; PS01187; EGF_CA; 10. 



KW Signal; Glycoprotein; Extracellular matrix; Plasma; EGF-like domain; 
KW Calcium-binding; Alternative splicing; Repeat. 
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30.3%; 


Score 729.5; DB 1; 


Length 1221; 



Best Local Similarity 35.3%; Pred. No. 9e-47; 

Matches 145; Conservative 59; Mismatches 148; Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= 1 HI I I hlhll III hi I II 
Db 832 RCMDGF-LQDPEGNCVDINECTSLLEPCRSGFSCINTVGSYTC 873 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVIJVDECATDSHQCNPTQIC 119 

Ihl II U ::||||:|| I hi hi 
Db 874 QRNPLVCGRGYHANEEGSECVDVNECETGVHRCGEGQLC 912 

Qy 120 INTEGGYTCSCTDGYW- -LLEGQCLDIDECRYG YCQQLCANVPGSYSCTCNPGFTLN 174 

I I I I I I : hh:M II I I I I I I hi II I 

Db 913 YNLPGSYRCDCKPGFQRDAFGRTCIDVNECWVSPGRLCQHTCENTPGSYRCSCAAGFLLA 972 

Qy 175 EDGRSCQDWECATENPCVQTCWTYGSFI CRCDPGYELEEDGVHCSDMDECS - FSEFLC 233 

11= I I | | | | | |||: | | ||:| Ml | : | : | | | : : I 
Db 973 ADGKHCEDVNECETRR - CSQECANI YGS YQCYCRQGYQLAEDGHTCTDI DECAQGAGI LC 1031 

Qy 234 QHECVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP 292 

Ml Ihl hll II - : NhhUI I h Hhhll hh 

Db 1032 TFRCVNVPGSYQCACPEQGYTMMANGRSCKDLDECALGTHNCSEAETCHNIQGSFRCL-R 1090 

Qy 2 93 I RCEEPYLRI SDNRCMCPAENPGCRD QPFTI LYRDMDWSGRSVPADI FQMQAT 346 

I hhl :| I hi | | : :: :| Ml ||:: 

Db 1091 FDCPPNYVRVSQTKC ERTTCQDI TECQTS PARI THYQLNFQTGLLVPAHI FRI GPA 1146 

Qy 347 TRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM 397 

= 1 I III I h ■ : : | : . | | : | |: | | 

Db 1147 PAFAGDTISLTITKGNEEGYFVTRRLNAYTGWSLQRSVLEPRDFALDVEM 1197 



RESULT 10 
FBL1_CERAE 

ID FBL1_CERAE STANDARD; PRT; 598 AA. 

AC Q8MJJ9 ; 

DT 15-SEP-2003 (Rel . 42, Created) 



DT 15-SEP-2003 (Rel . 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-lC (Fragment) . 

GN FBLN1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND INTERACTION WITH DTR . 

RX PubMed=11846885 ; 

RA Brooke J.S., Cha J.-H., Eidels L. ; 

RT "Latent transforming growth factor beta-binding protein-3 and 

RT fibulin-lC interact with the extracellular domain of the 

RT heparin-binding EGF-like growth factor precursor."; 

RL BMC Cell Biol. 3:2-2(2002). 

CC -!- FUNCTION: Incorporated into fibronect in-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organazation of ECM architecture, in particular to 

CC those of basement membranes. May serve to anchor the 

CC mature/soluble form of DTR to its fibers as it migrates through 

CC the extracellular matrix. The direct physical association with DTR 

CC may be useful in such tissue developmental processes as wound 

CC healing. 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components (By similarity) . Interacts with the 

CC mature/soluble form of DTR. 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains at least 2 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 9 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF395659; AAM90567.1; 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR Pfam; PF01821; ANATO; 1. 

DR Pfam; PF00008; EGF; 5. 

DR SMART; SM00104; ANATO; 1. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 1. 

DR PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 1. 

DR PROSITE; PS00010; ASX_HYDROXYL; 4. 



DR 


PROSITE; 


PS01186; 


EGF_2; 3. 




DR 


PROSITE; 


PS01187; 


EGF CA; 7 




KW 


Glycoprotein; Extracellular 


mAt" T*i x * RpTiPSt" * EC4F-1 "i \cf* Hrjm^ H n * 

1 1 Id t_> X J- -/V ^ Ly^Cl l» / XJ V_J ^ X, X J\ U.WIIIGI X 1 1 i 


KW 


Calcium-binding . 






FT 


NON TER 


1 


1 




FT 


CHAIN 


<1 


598 


FIRULIN-1C 

1. X 1— * VJ XJ X X<l X V* • 


FT 


DOMAIN 


<1 


27 


ANAPHYLATOXIN-LIKE 2 

liiNm i J. X L-tfl X VVi i X J.X XJ X I\XJ • 


FT 


DOMAIN 


28 


60 


ANAP H YLATOX IN-LIKE ^ 


FT 


DOMAIN 


92 


131 


EGF -LIKE 1 


FT 


DOMAIN 


132 


177 


EGF-LIKE 2 PAT.rTUM-RTNDT WG ( POTFNTT AT,} 

XjVJI. XJX IVXj £j J \ — rt.XJV_ X Ui 1 O X 1>I XV X LM v7 V *- vV X Hi IN X IriXj y . 


FT 


DOMAIN 


178 


223 


EGF-LIKE "3 CAT.rTIIM-RTNnTWn ( POTFNTT AT,} 

XJVji. XJX Ivu J, \_J"1XJV_X Ut'l OXiMxVXIMVJ \ C \J L EtVt X X £\±J / . 


FT 


DOMAIN 


224 


270 


EGF-LIKE 4 CALCIUM-RINDINn f POTFNTT AT,} 

Xjw x XJ X "3 ^ \ — V_ X V_J 11 LJ11M x> X LM \J \ x V_/ X CtLM X X .TVXJ / . 


FT 


DOMAIN 


271 


313 


EGF-LIKE S nATjrTUM-RTMDTMn f POTFKTTT AT,} 


FT 


DOMAIN 


314 


355 


TTrip-T.T VT? £ PAT.fTTTM-RTMnTMr; fPOTPMTTAT } 

uur XJXi\.xj \J , \ — rt.XJ\ L Ul v l O X 1M XV X IN O \ rUl XjIN 1 IHJj ^ . 


FT 


DOMAIN 


356 


395 


Pf^P-T.TlCF 7 PAT.PTTTM-RTWnTKrn f PHTPMTTZ1T } 


FT 


DOMAIN 


3 96 


43 9 


PHIP-T.TVP ft PAT .PTTTM-RTMnTMn f DHTT7MTT&T \ 


FT 


DOMAIN 


44 0 

^ 1 \V 


484 


PriP-T.TKP Q PAT .PTTTM-RTMnTMH I DnTTrMTT AT \ 


FT 


DOMAIN 

x^\*/i in .x. 


271 


355 


CpT.F- ASSOPT ATTOM AMD PM1 -RTMHTMH fRV 

JCti-lC rtijOULln X X VJ1N iriXNXV x IN X D X IN XVX LNVJ ^DI 


FT 








^ TMT T.AR TTV} 


FT 


DISULFID 


v. X 


25 


RY STMTT.APTTY 


FT 


DISULFID 

Xv X I— > \J XJ i. 1 L> 


7 


26 


RY CITMTT.APTTY 
xv X DlrJl Xvrlrx. XXX. 


FT 


DISULFID 

X*f X |_J *J XJ i. XX/ 


28 


52 


RY STMTT.ARTTV 

XV X ijltMl Xxrltx XXX. 


FT 


DISULFID 


29 


59 


RY STMTT.ARTTY 

OX lJ 1 11 X 1 lr-\ r\ XXX. 


FT 


DI SULFID 

1-/ J. lD UiJi X XV 


4? 


60 


RY QTMTT.APTTV 
DI Dlnl XxfiK XXI. 


FT 


DISULFID 

J— 7 IlJU XJX X XV 


.7 o 


106 


RY STMTT.ARTTY 

OX O X 1*1 X 1 lri£\. XXX. 


FT 


DISULFID 


102 


115 


RY STMTT.ARTTY 

O X O X 1*1 X Xxrlrv. XXX. 


FT 


DISULFID 


117 


130 


RY STMTT.ARTTY 
xv X O X 1*1 x xxrin. XXX. 


FT 


DISULFID 


136 


14 9 


RY STMTT.ARTTY 

■OX O X l v l X Xjrirv. XXX. 


FT 


DT SUT.FTD 

LJ J. O U XJ x X XV 




158 


RY QTMTTARTTV 
DI O X l v l X XxfiK 111. 


FT 


DISULFID 


164 


176 


RY QTMTT.APTTV 
OX DIl'Jl XjMJa. XXX. 


FT 


DISULFID 
xv x ljuxit x xv 


X 0 ^ 


195 


RV QTMTT.APTTV 
rs i o x i v i x XjHK. xix. 


FT 


DISULFID 

XV X kV WXJx X XV 


X 0 


204 


RY STMTT.ARTTY 
ox □ i n 1 1 iM.r\. xxx. 


FT 


DISULFID 


£i X \J 


222 


R Y STMTT.ART TY 

OX O X l\l X iJriK XXX. 


FT 


nisuLFTn 

xv x lj uur x xv 


£i O 


242 


RY STMTT.ARTTV 

OX Dlril Xxri£\. XXI. 


FT 


DTSUT.FTD 

XV X tV U XJX7 X XV 


^ j / 


270 


RY QTMTT.APTTV 
OX Olni Xxrixx. XXI. 


FT 


DTSIIT.FTD 

XV X LV UXJX7 X XV 




288 


RV QTMTT.APTTV 

OX OXi'Jl 1 'M K XXI. 


FT 


DT SUTiFTD 

XVX UUJJJ. X XV 






RV QTMTT.APTTV 
Xv I O 1 I v l X XjHja. XII. 


FT 


DT SUT.FTD 
xvx juxir x xv 


OQQ 




R V STMTT.ART TV 

OX O X 1*1 X XlrLrs. XXI. 


FT 


DT STIT.FTD 

XVX LV IVXJX X XV 


^ 1 ft 


^ ^ 0 


RV QTMTT.APTTV 
DI O X l v i X XxfiK XXI. 


FT 


DT SUT.FTD 
uiDuur xxv 


^ 9 


^ ^ 9 
^> 0 j 


RV QTMTT.APTTV 
DI O X 1*1 1 XxfiK 111 , 


FT 


DISULFID 


341 




RV QTMTT ADT TV 
DI O X l v l X XjHK Xll. 


FT 


DISULFID 


360 


369 


RV QTMTT ADT TV 
DI o 1 L v l X XxfiK 111. 


FT 


DISULFID 


365 


378 


RV QTMTTARTTV 
D I O X 1*1 X IjHK XXI. 


FT 


DISULFID 


380 


3 94 


RV QTMTT ADT TV 
DI O 1 1X1 1 xjAK 111. 


FT 


DISULFID 


400 


413 


RY STMTT.ARTTY 
xv x j x 1*1 x xxf-vrv. xxx. 


FT 


DISULFID 


409 


422 


RY STMTT.ARTTY 

XJ X kV X 1*1 X J— Lrt.X\. XXX. 


FT 


DISULFID 


424 


438 


RY STMTT.ARTTY 

xv x i_) X i*i x xxrir\. x x X . 


FT 


DISULFID 


444 


457 


BY SIMILARITY. 


FT 


DISULFID 


451 


466 


BY SIMILARITY. 


FT 


DISULFID 


471 


483 


BY SIMILARITY. 


FT 


CARBOHYD 


14 


14 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


450 


450 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


454 


454 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


598 AA; 


65516 MW; 849BF018DF452B02 CRC64 ; 


Query Match 




30.0%; 


Score 721; DB 1; Length 598; 


Best Local Similarity 


35.3%; 


Pred. No. 1.9e-46; 



Matches 155; Conservative 72; Mismatches 184; Indels 28; Gaps 15; 



Qy 2 CTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PR TNPVYRGPYSNP 58 

I l = :| : I 111 = 1 : I I =1 I I = I h I = = | 

Db 164 CGTGYEL-TEDNSCKDIDQCESGIHNCLPDFICQNTLGSFRCRPKLQCKNGFIQDALANC 222 

Qy 59 YS TPYSGP YPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQC 113 

= 111 I I II ::| Ihlll = I 

Db 223 I DINECLS I VSAPCPTGHTCINTEGSYTQKNVPNCGRGYHLNEEGTRCDVNECAPPAEPC 282 

Qy 114 N PTQ I C I NTEGG YTCS CTDG YWL - - LEGQCLD I DEC - RY - - G YCQQLCANVPGS YS CTCN 168 

hh hll Ih = h|::|| II III III |:|: 

Db 283 GKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYVCSCS 342 

Qy 169 PGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECS 227 

M I M-'-: :|| | | | |||: | | |hl = III I hllh 
Db 343 VGFRLSVDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECA 401 

Qy 228 F - - SEFLCQHECVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQ 284 

U : hi Ih: I I II II I : hlllhll I h: :|hhl 

Db 402 LPTGGHICSYRCINI PGSFQCSCPASGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQ 461 

Qy 285 GGFKCI DPI RCEEPYLRI SDNRC-MCPA- ENPGCRDQPFTI LYRDMDWSGRSVPADI FQ 342 

Mhh II f I : II I || | |||: : II :| = 

Db 462 GGFRCL-AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPAWFR 520 

Qy 343 MQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREI QLDLEM - - I TV 400 

I - II I III I h I I : :hh Ih: I : = | 

Db 521 MGPSSAVPGDSMQLAI TGGNEEGFFTTRKVS PHSGWALTKPVPEPRDLLLTVKMDLYRH 580 

Qy 401 NTVI NFRGS S VI RLR I YVS 419 

II H I H hll 

Db 581 GTVSSF VAKLFIFVS 595 

RESULT 11 
FBL1_CHICK 

ID FBL1_CHICK STANDARD; PRT; 704 AA. 

AC 073775; 073774; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall i formes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS C AND D) . 

RC TISSUE=Embryo; 

RX MEDLINE=99120531; PubMed=9923656 ; 

RA Barth J.L., Argraves K.M. , Roark E.F., Little CD., Argraves W.S.; 

RT "Identification of chicken and C. elegans fibulin-1 homologs and 

RT characterization of the C. elegans fibulin-1 gene."; 

RL Matrix Biol. 17:635-646(1998). 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



FUNCTION: Incorporated into fibronect in-containing matrix fibers. 
May play a role in cell adhesion and migration along protein 
fibers within the extracellular matrix (ECM) . Could be important 
for certain developmental processes and contribute to the 
supramolecular organization of ECM architecture, in particular to 
those of basement membranes . 

SUBUNIT: Interacts with iself and with various extracellular 
matrix components (By similarity) . 

SUBCELLULAR LOCATION: Secreted; extracellular matrix. 
ALTERNATIVE PRODUCTS: 

Event ^Alternative splicing; Named isoforms=2; 
Name^D; 

I sol d=07 3775-2; Sequence=Di splayed ; 
Name=C; 

Isold=073775-1; Sequence=VSP_007378 ; 



SIMILARITY 
SIMILARITY 
SIMILARITY 



Belongs to the fibulin family. 
Contains 3 anaphylatoxin-like domains, 
Contains 9 EGF-like domains. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF051399; AAC05387.1; -. 
EMBL; AF051400; AAC05388.1; 
HSSP; P00742; 1HCG. 
InterPro; IPR000020; Anaphylatoxin. 
InterPro; IPR000152; Asx_hydroxyl . 
InterPro; IPR001881; EGF_Ca . 
InterPro; IPR006209; EGF_like. 
Pfam; PF01821; ANATO; 2. 
Pfam; PF00008; EGF; 6. 
SMART; SM00104; ANATO; 3. 
SMART; SM00181; EGF; 9. 
SMART; SM00179; EGF_CA; 8. 
PROSITE; PS00010; ASX_HYDROXYL ; 5. 
PROSITE; PS01177; ANAPHYLAT0XIN_1 ; 1. 
PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 2. 
PROSITE; PS00022; EGF_1 ; FALSE_NEG. 
PROSITE; PS0118 6; EGF_2 ; 3. 
PROSITE; PS01187; EGF_CA; 8. 

Signal; Alternative splicing; Glycoprotein; Extracellular matrix; 



POTENTIAL. 
FIBULIN-1. 

ANAPHYLATOXIN-LIKE 1. 
ANAPHYLATOXIN-LIKE 2. 
ANAPHYLATOXIN-LIKE 3. 
EGF-LIKE 1. 

EGF-LIKE 2, CALCIUM -BINDING (POTENTIAL) . 
EGF-LIKE 3, CALCIUM- BINDING (POTENTIAL) . 
EGF-LIKE 4, CALCIUM- BINDING (POTENTIAL) . 
EGF-LIKE 5, CALCIUM- BINDING (POTENTIAL) . 



KW 


Repeat ; 


EGF-like 


domain 


FT 


SIGNAL 


1 


25 


FT 


CHAIN 


26 


704 


FT 


DOMAIN 


33 


74 


FT 


DOMAIN 


75 


109 


FT 


DOMAIN 


110 


142 


FT 


DOMAIN 


177 


216 


FT 


DOMAIN 


217 


262 


FT 


DOMAIN 


263 


308 


FT 


DOMAIN 


309 


356 


FT 


DOMAIN 


357 


399 



FT 


DOMAIN 


400 


441 


EGF-LTKF PAT.PTTIM-RTMnTKTn f PnTFNTT AT,} 


FT 


DOMAIN 


442 


481 


FflF-TiTKF 7 rAT.PTTIM-RT WDTMn; ( POTPMTT AT.) 
d\jr xj x rvi_j / ; v — ttxjv„x uri o x im jjxino \ 1 HilM i. X r\±j } , 


FT 


DOMAIN 


482 


525 


EGF-LTKF ft rAT.PTTIM-RTMnTKTn fPDTFNTTAT.) - 


FT 


DOMAIN 


526 


579 


EHF-LTKF 9 PAT.PTT TM-RTMnTTtffl ( POTFTJTT AT.) 


FT 


DOMAIN 


357 


441 


^FLF-A^^OPT ATTOM AMD FMM1 -RTNTDTNTn (RV 

O I— i-l_J! riu JWLJi .rt. X J. WXv LJ rlMl DllVUlLNVj \D1 


FT 








CITMTT.ARTTY) 


FT 


DISULFID 

XV X W XJ X XV 


33 


59 


BY ^TMTT.ARTTY 
ox o J- 1*1 x J— Lrt.r\. xxx* 


FT 


DISULFID 

AV X ±J w XJ4> X X/ 


34 


66 


BY ^TMTT.ARTTY 


FT 


DISULFID 

XV X ^V W XJ^ x xv 


47 


67 


RY ^TMTT.ARTTY 

LJ X O X 1*1 X X_LiAI\. XXX. 


FT 


DISULFID 

XV J- LV W XJ^ X xv 


76 


107 


RY CITMTT.ARTTY 
o x 1*1 x i_in.x\. xxx. 


FT 


DISULFID 

XV X tV VV XJ 1. X L~J 


89 


108 


RY CJTMTT.ARTTY 
ox jinx i r\ x i i . 


FT 


DISULFID 


110 


134 


BY SIMILARITY 

i—J X U 1 11 X i LZxlv XXX. 


FT 


DISULFID 


111 


141 


BY SIMILARITY 

i—> x ji 1*1 x unrv. x x x • 


FT 


DISULFID 


124 


142 


RY TMTT.AR TTY 

u 1 OX 1*1 X XXrllx xxx. 


FT 


DISULFID 

XV X tV X>J A. X XV 


181 


191 


RY CJTMTT.ARTTV 

OX O X 1*1 X Xxrirv. XIX. 


FT 


DISULFID 


187 


200 


RY ^TMTT.AR TTY 

OX O X 1*1 X XxrlX\. xxx. 


FT 


DISULFID 


202 


215 


BY SIMILARITY 

OX OX 11 X Uril\ xxx. 


FT 


DISULFID 


221 


234 


BY SIMILARITY 

i-J X t_7 X 11 X J — LTi.lv XXX. 


FT 


DISULFID 


228 


243 


RY QTMTT.ARTTY 

OX O X 1*1 X XJrt.i\. xxx. 


FT 


DISULFID 

XV X kV \J -1— J i- X XV 


249 


261 


RY ^TMTT.ARTTY 

OX O X 1*1 X Xxtt.I\. XXX. 


FT 


DISULFID 


267 


280 


RY STMTT.ARTTY 

OX OX 11 X XXnlx XXX. 


FT 


DISULFID 


274 


289 


RY CITMTT.ARTTY 

OX OX 1*1 X XXrt.I\. xxx. 


FT 


DISULFID 


295 


307 


RY CJTMTT.ARTTY 

OX OX 1*1 X X_Ltt.I*v XXX. 


FT 


DISULFID 

J-V X kV WUl X XV 


313 


O £*\J 


RY CITMTT.ARTTV 

OX O X 1*1 X X_LriX\. XXX. 


FT 


DISULFID 


320 


335 


RY ^TMTT.ARTTY 

OX O X 1*1 X X-L£-Ll\. XXX. 


FT 


DISULFID 


342 


355 


RY ^TMTT.ARTTY 

OX O X 1*1 X l_Lrt.I\. XXX. 


FT 


DISULFID 

AV X t^J U4J1 X XV 


361 


374 


RY CITMTT.ARTTV 

OX O X 1*1 X Xxc-Lxs. XXX. 


FT 


DISULFID 


368 


383 


RY 9TMTT.ARTTY 

OX OX 11 X XJriI\ XXX. 


FT 


DISULFID 

XV X W XJ X, X XV 


385 


398 


RY CITMTT.ARTTY 

OX O X 1*1 X Xxrirv. 111. 


FT 


DISULFID 

XV X vUi X XV 


404 ■ 


416 


RY CJTMTT.ARTTY 

OX O X 1*1 X J_Lrii\. XIX. 


FT 


DISULFID 


412 


4? ^ 


RY QTMTT.ARTTV 

OX Oll'll X-lM-iY 111, 


FT 


DISULFID 


427 


44 0 
^± *± \j 


RY CITMTT.ARTTV 
o x olrll IxrirC HI . 


FT 


DISULFID 

X*/ X lV uui X XV 


446 




RY QTMTT.ARTTY 

OX O X 1*1 X Xxril\. XXX. 


FT 


DISULFID 


*x _? x 


464 
*± o *± 


RY C! TMTT.AR TTV 
DI o X 1*1 X XxfvK 111 . 


FT 


DISULFID 

XV X ±J UJJi X XV 


466 


4ft 0 


RY CI TMTT.AR TTY 

D 1 O X 1*1 X XxriiN. X 1 1 . 


FT 


DISULFID 


486 


499 


BY ^TMTT.ARTTY 

OX OX 1*1 X lxrV£\. XXX. 


FT 


DISULFID 


495 


508 


BY SIMILARITY 

-UV X iV X 1 1 X XLTi.l\ XXX* 


FT 


DISULFID 


510 


524 


BY SIMILARITY. 


FT 


DISULFID 


530 


543 


BY SIMILARITY 

' X Ull 11 -1— liilV XXX. 


FT 


DISULFID 


537 


552 


BY SIMILARITY 

x x i i x i ini\ xxx* 


FT 


DISULFID 


557 


578 


BY SIMILARITY. 


FT 


CARBOHYD 


96 


96 


N-LINKED (GLCNAC. . ,.) (POTENTIAL). 


FT 


CARBOHYD 


536 


536 


N-LINKED (GLCNAC ) ( POTENTIAL} 


FT 


CARBOHYD 


540 


540 


N-LINKED (GLCNAC ) (POTENT TAT.) 


FT 


VARSPLIC 


568 


704 


VRLEKTDTTRPTKCIPRPM^/MPW.nPATT-rTT c;T4T\/T QT.PTFD 


FT 








FFTR PFFT TFT.PAT TPTVPZiMnaTiT T T?TlT T'Cr'MT D u?CT7PiT T 
Cjr ifviroCji x r xjKi-iX 1 f I i ir>UNy/\iJl X r JJx 1 EjoINIljKCjor JJx X 


FT 








KR YMDGMTVGWROVR P I VH PFHA T T .KT .FMNYVMnnVVQHR 

A " » J- * 4±rf"V» II 1 X V VwJ V V V In. 1 X V \J til mi * 1 ( x '_J L_J I IX N X V 1 1\J\J V V 0 11 Xv 


FT 








NIVNVHIFVSEYWF -> RCERLPCNENKECQSLPLRI TYY 


FT 








HLSFPTNI QVPTDI FRMGPSNAVPGDKI LLS I I SGNQEGFF 


FT 








TTKKVNNHSG I WMQRQ I TE PRDLLLT I QMQLTRHGTVNTF 


FT 








IAKLFVFVSAQL (in isoform C) . 


FT 








/FTId=VSP_007378 . 


SQ 


SEQUENCE 


704 AA; 


78137 


MW; D47D5A30D5E42932 CRC64 ; 



Query Match 2 9.6%; Score 713; DB 1; Length 704; 

Best Local Similarity 35.1%; Pred. No. 8.8e-46; 

Matches 160; Conservative 64; Mismatches 154; Indels 78; Gaps 21; 



Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II III I I hlhll : I I :| I I I :| 
Db 294 QCMNGFI QD-ALGNCI DI NECLSTNMPCPAGQI CI NTDGS YTC-QRI SP 340 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDE-SNQCVDVDECATDSHQCNPTQI C 119 

I II ::| :|lllllh: I = | 

Db 341 -SCGRGYHLNEDGTRCVDVDECSSSDQPCGEGHVC 374 

Qy 12 0 I NTEGG YTCS CTDG YW - - LLEGQCLD I DE CRY GYCQQLCANVPGSYSCTCNPGFTLN 174 

M I I I I II hlhlll I I I I I I I III II 

Db 375 INGPGNYRCECKSGYSFDVISRTCIDINECRRYPGRLCAHKCENTPGSYYCTCTMGFKLS 434 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF- -SEF 231 

llllhhlll •'Mill llh I I I : lh I hllh 

Db 435 SDGRSCEDLNECES-SPCSQECANVYGSYQCYCRRGFQLSDIDGISCEDIDECALPTGGH 493 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

= 1 hi M- 1=11 M I = hlllhll I h :||:|:|||hh 

Db 494 I CSFRCI N I PGS FQCTCPSTGYRLAPNARNCQDI DECVAETHNCS FNETCFN I QGGFRCL 553 

Qy 291 DPIRCEEPYLRISDN RCM- -CPAENPGC-RDQPFTILYRDMDWSGRSV- - 336 

: I I I : I lh | : I I || : : : : | 

Db 554 S - LECPENYRKSGDTVRLEKTDTI RCI KSCRPNDVNCVLDPVHTI SHTVI SLPTFREFTR 612 

Qy 337 PADI FQMQA-TTRYPG- -AYYI FQI KSGNEGREF YMRQTG P I SATLVMTR P I KG P 388 

I : I : : I I II I II I II I II I : Ml || 

Db 613 PEEI IFLRAITPTYPANQADI IFDITEGNLRESFDI IKRYM- -DGMTVGWRQVRPIVGP 670 

Qy 389 RE I QLDLEM - I TVNTVI NFRGSSVI RLRI YVSQYPF 423 

I III : h= I : hlhl I 

Db 671 FHA I LKLEMN YVMGG WSHR - - N I VNVH I FVS E YWF 704 

RESULT 12 
FBL2_HUMAN 

ID FBL2_HUMAN STANDARD; PRT; 1184 AA. 

AC P98095; 

DT 01-NOV-1995 (Rel . 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-2 precursor. 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota ; Metazoa ; Chordata ; Craniata ; Vertebrata ; Euteleostomi ; 

OC Mammal ia ; Eutheria ; Primates ; Catarrhini ; Hominidae ; Homo . 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fibroblast ; 

RX MEDLINE=95 104855 ; PubMed=78 0623 0 ; 

RA Zhang R.-Z., Pan T.-C, Zhang Z.-Y., Mattei M.-G., Timpl R., 

RA Chu M. -L. ; 

RT "Fibulin-2 (FBLN2) : human cDNA sequence, mRNA expression, and mapping 

RT of the gene on human and mouse chromosomes."; 

RL Genomics 22:425-430(1994). 

RN [2] 



RP DEVELOPMENTAL STAGE . 

RX PubMed=8737292; 

RA Miosge N., Gotz w., Sasaki T. , Chu M.-L., Timpl R. , Herken R.; 

RT "The extracellular matrix proteins fibulin-1 and fibulin-2 in the 

RT early human embryo . " ; 

RL Histochem. J. 28:109-116(1996). 

CC -!- FUNCTION: ITS BINDING TO FIBRONECTIN AND SOME OTHER LIGANDS IS 
CC CALCIUM DEPENDENT. 

CC -!- SUBUNIT: Homotrimer; disulf ide-linked. Interacts with LAMA2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- TISSUE SPECIFICITY: Component of both basement membranes and other 

CC connective tissues. Expressed in heart, placenta and ovary. 

CC -!- DEVELOPMENTAL STAGE: Widely expressed during embryonic 

CC development. Primarily detected within the neuropithelium, spinal 

CC ganglia and peripheral nerves. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 11 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; X82494; CAA57876.1; -. 

DR PIR; A55184; A55184 . 

DR HSSP; P00736; 1APQ. 

DR Genew; HGNC:3601; FBLN2 . 

DR MIM; 135821; -. 

DR GO; GO: 0005578; C : extracellular matrix; TAS. 

DR GO; GO: 0005509; F:calcium ion binding activity; TAS. 

DR GO; GO: 0005207; F : extracellular matrix glycoprotein; TAS. 

DR Inter Pro; I PRO 00 02 0; Anaphylatoxin . 

DR InterPro; IPR000152; Asx__hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR0062 09; EGF_like. 

DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 7. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 5. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 3. 

DR PROSITE; PS00022; EGF_1 ; FALSE_NEG. 

DR PROSITE; PS01186; EGF_2 ; 5. 

DR PROS I TE ; PS 0 1 1 8 7 ; EGF_CA ; 9 . 

KW Signal; Glycoprotein; Extracellular matrix; Plasma; EGF-like domain; 

KW Calcium-binding; Repeat . 

FT SIGNAL 1 27 POTENTIAL. 

FT CHAIN 28 1184 FIBULIN-2. 

FT DOMAIN 28 444 N. 

FT DOMAIN 28 177 SUBDOMAIN NA (CYS-RICH) . 

FT DOMAIN 178 444 SUBDOMAIN NB (CYS-FREE) . 



FT 


DOM A T N 


T " J 


4 ft n 

*x O \J 


ANAPHYT.ATOY TN-T.TKF 1 

AWArn 1 Xlfi i UA 1 1M XjXrvLj X 




FT 


DOMAIN 


TOO 


519 

J X J 


AMAPHYT.ATOy TN-T.TKF 9 




FT 


DOMAIN 

LV Wl .Llx X Xl< 


521 


553 


ANAPHYT.ATOXTN-LTKF 7 

r\ls*r\.]Z LI X XXrt. X X XN XJ X IVLj »V 




FT 


DOM A T N 


04 


64 5 


FfJF-T.TK"F 1 PAT ■HTTTTvl- 
xjVJJir XjX ivxj X, V rt.Xj^ L U1 V J 


O X 1M U X IN \J . 


FT 


DOMAIN 

xv wi 'm. x xn 


679 


71ft 
/ x o 


FflF-T.TKF 9 

XjVJJ x xjx rvxj . 




FT 


DOMAIN 


719 


763 


FCIF-LTKF 7 OAT.fTTTM- 


RTNDTNP 

Xj X 1M XV X XM O . 


FT 


DOMA I N 

Uvllni xv 


764 


ft 09 

O U V 


FnF-T.TKF 4 OAT.PTTTM- 


RTNnTNfJ 

D X 1M XJ X IN \J . 


FT 


DOMAIN 


810 


857 


FHF-LTKF 5 OAT.PTTIM- 


RTNHTNP 

DX1MU11MO . 


FT 


DOMAIN 


858 

O J o 


900 

.7 V V 


FOF-T.TKF 6 PAT.PTTTM- 

XJvJJ F XJX ivLj O f V__rt.XJV_ X \Jl*l 


RTNDTNn 

D X IN LJ X IN O - 


FT 


DOMAIN 

.LV VV1 Ixi. X XM 


901 

V ux 


949 

J? *± 


FPF-LTKF 7 PAT.PTTIM- 

Ciur XJX IYxj / / k__rt.XJV — X Ul J 


D X IN XV X IN O . 


FT 


DOMA T N 
i/uririi in 


947 

Zf M J 


9ft 1 

-7 O X 


FPF-T.TKF ft PAT.PTTTM- 


RTMHTMP 
D X IN LJ X IN O . 


FT 


DOMA I N 

XV VI in X XX 


982 


1024 


FfiF-LTPCF 9 PAT.PTTTM- 

iJUr XJX IVXj J7 , V rtJULi 


RTMHTNP 

O X IN LJ X IN \J . 


FT 


DOMA T N 

L/VJL in X XM 


1025 


1 069 

X W D J 


FPF-T.TKF 1 0 PAT.PTTTM 
Ciur xj x rvd iu ( v rt.Xjv_x VJi i 


-RTNTRTMP 
DXXNXv'XXNvJ . 


FT 


DOMAIN 

IVWi*IM.X IN 


i 070 

x u / u 


1 1 ft4 


nOMI\TKT TTT 
LV^Vl v iri X X\J XXX. 




C I 


DT CTTT.FTn 
xvi ouxir x u 


4 4 R 


4 79 


DV CTMTT ZVDTTV 
D I O X l v i 1 IjAK 111. 




FT 


DT CITIT.FTD 
xvxouur x xv 


44fi 


4 7 9 


DV CTMTT.aPTTV 
O 1 Oll'Jl i_iH.K ±11. 




FT 


DI^TJLFTD 


4 


4 ft 0 

t: O U 


Ox 0 1 l"J i Xxrt.i\. X X X . 




FT 


DISULFID 


489 


518 


BY STMTTiAR TTY 
xj x <lJ x n x xini\ xxx. 




FT 


DISULFID 


502 


519 
J x > 


RY ^ TMTT.AR TTY 

xv X O X 1*1 X XJJrllA. XXI. 




FT 


DISULFID 

L/liJUiJl X XV 


521 


54 5 

-V *± J 


RY cjTMTTiARTTY 

xv X O X 1*1 X XxrlXx XXX. 




FT 


DISULFID 

UluUlJl XXV 


522 


5 59 
-j j ^ 


RY ^ TMTT.ARTTV 

xv X O X 1*1 X Xxrt.lv XXX. 




FT 


DISULFID 


535 


553 


RY STMTT.ARTTY 

OX O X 1*1 X XJrli\ XXX. 




FT 


DISULFID 


608 


620 


RY STMTT.APT TY 

OX J X 1*1 X XJJrUv XXX. 




FT 


DT SULFTD 

XV X O kj XJX 1U 


O -X o 


69 9 

D Z 


RV CTMTT. ZVPTTV 
ox O X 1 ¥ 1 1 IxflK 111. 




FT 


DT CTTT.FTn 
uiouxir x xv 


671 


644 

D *± t: 


RV CTMTT RDTTV 
ox 0 1 I v i 1 IxfiK 1 1 x . 




FT 


DT CUT ,FTD 
uiDuxtr x xv 


DOj 


6Q7 


RV CTMTT aDTTV 
ox 0 1 l v i 1 XxfiK 111. 




FT 


DT CTTT.FTn 
xv x ouLir x xv 




709 


RV CTMTT BDTTV 
ox o X l v l 1 XxfiK 1 1 X . 




FT 


DT ^UTiFTD 
L/i juxir x xv 


704 


717 
/ x / 


RY CTMTT. aDTTV 
ox o X l v l X Xxrt.K 111. 




FT 


DT CTTT.FTn 
ui ouxir x xv 


797 


7 7 6 

/JO 


"RV CTMTT ZVDTTV 
L5 X O 1 l v l X XxfiK 111. 




FT 


DT 91 IT iF ID 

i/liJUXIl X XV 


77 n 


74 5 


RY CTMTT. aPTTV 
O X O X l v l X XjjH.I\. XXX. 




FT 


DI 9ITLFTD 

XV X lD \J XJI X XV 


7R1 


769 


RY CTMTT. APT TV 
ox o X l v l X XxrirC HI. 




FT 


DISULFID 
ivxtvuxir xxv 


76R 

/DO 


7ft 1 
/ox 


RY CTMTT. APT TV 
ox OlrJl IxfVK HI. 




FT 


DISULFID 

J—' X taV wXIl X XV 


77R 


790 


RY CTMTT.APTTY 

o X O X 1*1 X XxMJA. XXX. 




FT 


DT 9T JT.FTD 


7 Q6 

/ vO 


ft Oft 


RV CTMTT. I\PTTV 
ox olrll XxrtK ill. 




FT 


DT CTTT.FTn 
xv x juDr x xv 


ft 1 4 

O X *± 


ft 97 


RV CTMTT A D T TV 
o I o 1 1*1 1 XxRK 111. 




FT 


DT 91 IT.FTD 

XV X iD LvXJx X XV 


ft 9 1 

O X 


ft 7 6 

O JO 


DV CTMTT ADT TV 
ox O 1 l v l 1 XjAK 111. 




FT 


DT STIT.FTD 

XV 1 JU XJ x X XV 


ft4 7 


ft 56 

ODD 


RV CTMTT ZVDTTV 
ox O 1 1*1 1 lxrt.K 111. 




FT 


DISULFID 

XV X iJ VV -1— 1 J> xxv 


862 


ft 75 

O / 3 


RY CTMTT. Z\PTTV 
OX DJ.ni xxrt.iv XIX. 




FT 


DT SUTiFTD 
xv x lj \j±jr xxv 


ft 6Q 

O D *7 


ft ft 4 
o O *± 


RV CTMTT ZVDTTV 
ox 0 1 l v l 1 IxflK 111. 




FT 


DT STILFTD 
xv x juur xxv 


ft ft 6 
O O D 


ft 9 9 
O J 


RV CTMTT ZVDTTV 
ox o 1 1*1 1 ixfvK 1 1 X . 




FT 


DI ST JLFTD 

J-V X lJ \J XJx X XV 


90R 

-7 \V «V 


91 7 

-7 X / 


RV CTMTT ZVDTTV 
ox o X 1*1 X ixRK 111. 




FT 


DISULFID 

UliJUlJl xxv 


917 

J X J 


99 6 


RY CTMTT. APT TV 
o I o X 1*1 X IxriK XXI. 




FT 


DISULFID 

i-t X lV vV XJ 1. X XV 


92R 


94 1 

-7 ^1 X 


RY CTMTT.AR TTV 
ox O X 1*1 X XxHJX 111. 




FT 


DT SULFTD 
xv x l_j uxjr x xv 


94 7 


9 56 
v jD 


RV CTMTT ZVDTTV 
o I O X l v l 1 IxflK 111. 




FT 


DT SULFTD 
xv x o w xjir x xv 


9R9 

-7 J £i 


9 6 5 
vDj 


DV CTMTT ADTTV 
o I o 1 l v l 1 lxfiK 111. 




FT 


DT SUTiFTD 

xvXkvwxji. x xv 


967 


9ft 0 
y o u 


RV CTMTT ZVDTTV 
o I o X l v l 1 lxfiK 111. 




FT 


DI SULFTD 

1V1. tJWXJl xxv 


986 


99ft 


RY CTMTT. ADTTV 
OX O X 1*1 X XXrirs. 111. 




FT 


DISULFID 

U1UU1J1 X x*s 


994 


1 007 
x v w / 


RY CTMTT.AP TTV 

OX O X 1*1 X Xxrt.iv XII. 




FT 


DISULFID 

XV X LJ \J XJ i X XV 


1 D09 

1 U U V 


109 7 

X U £t «v 


RV CTMTT. ZVDTTV 
O I D X 1*1 1 xxHK 111. 




FT 


DISULFID 


1029 


1042 


RY ^IMTTiAR TTY 
ox kv x ii x xxrtiv xxx. 




FT 


DISULFID 


1036 


1051 


BY SIMILARITY . 




FT 


DISULFID 


1056 


1068 


BY SIMILARITY. 




FT 


CARBOHYD 


180 


180 


N-LINKED (GLCNAC. . 


. ) (POTENTIAL) 


FT 


CARBOHYD 


507 


507 


N-LI NKED ( GLCNAC . . 


. ) (POTENTIAL) 


FT 


CARBOHYD 


1035 


1035 


N-LINKED (GLCNAC. . 


. ) (POTENTIAL) 


SQ 


SEQUENCE 


1184 


AA; 126543 


MW; CA484 90A55F9EC5D 


CRC64; 



Query Match 2 9.1%; Score 701.5; DB 1; Length 1184; 

Best Local Similarity 34.5%; Pred. No. le-44; 

Matches 142; Conservative 58; Mismatches 152; Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= 1 HI I I hlhll :: II hi I I I 
Db 795 RCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC 836 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQ-MDESNQCVDVDECATDSHQCNPTQIC 119 

I I I I II h :|llhll I hi hi 
Db 837 QRNPLICARGYHASDDGAKCVDVNECETGVHRCGEGQVC 875 

Qy 120 I NTEGG YTCS CTDG YW - - LLEGQCLD I DECRYG YCQQLCAN VPGS YS CTCN PGFTLN 174 

I I I I I h II I I III hi 1 1 I 

Db 876 HNLPGSYRCDCKAGFQRDAFGRGCIDWECWASPGRLCQHTCENTLGSYRCSCASGFLLA 935 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECS - FSEFLC 233 

Ih 1 = 11111 : I I I I Mh I I Ihl III I : I : I I I : : II 
Db 936 ADGKRCEDVNECEAQR-CSQECANIYGSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILC 994 

Qy 234 QHECWQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDP 292 

hi Ihl hll II = : llhh:M I h Hhhll hh 

Db 995 TFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHNIQGSFRCL-R 1053 

Qy 293 I RCEEPYLRI SDNRCMCPAENPGCRD QPFTI LYRDMDWSGRSVPADI FQMQAT 346 

I h-l :| III | | : :: :| Ml ||:: 

Db 1054 FECPPNYVQVSKTKC ERTTCHDFLECQNS PAR I TH YQLNFQTGLL VPAH I FR I G PA 1109 

Qy 347 TRYPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDLEM 397 

: I II I I = :.: | : ||: ||:|| 

Db 1110 PAFTGDTI ALNI I KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDFALDVEM 1160 



RESULT 13 
FBL1 HUMAN 



ID FBL1_HUMAN STANDARD; PRT; 703 AA. 

AC P23142; P23143; P23144; P37888; Q8TBH8; Q9HBQ5; Q9UGR4 ; Q9UH41; 

DT 01-NOV-1991 (Rel. 20, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS A; B AND C) . 

RX MEDLINE=91100426; PubMed=2269669 ; 

RA Argraves W.S., Tran H. , Burgess W.H., Dickerson K. ; 

RT "Fibulin is an extracellular matrix and plasma glycoprotein with 

RT repeated domain structure."; 

RL J. Cell Biol. 111:3155-3164(1990). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM D) , TISSUE SPECIFICITY, AND INTERACTION 

RP WITH FN1 AND FGB . 

RX PubMed=9106159; 



RA Tran H. , Mattei M., Godyna S., Argraves W.S.; 

RT "Human fibulin-lD: molecular cloning, expression and similarity with 

RT Sl-5 protein, a new member of the fibulin gene family."; 

RL Matrix Biol. 15:479-493(1997). 

RN [3] 

RP SEQUENCE FROM N.A. {ISOFORM D) . 

RX MEDLINE=99253993; PubMed=10318851 ; 

RA Krichevsky A.M., Metzer E., Rosen H. ; 

RT "Translational control of specific genes during differentiation of 

RT HL-60 cells . " ; 

RL J. Biol. Chem. 274:14295-14305(1999). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M. , Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y. , Qiu X.K., Qian L.F., He L.P., Li H.N. , Yu Y. , 

RA Yu J., Han L . H . ; 

RT "Novel Human cDNA clones with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-20057165; PubMed=105912 08 ; 

RA Dunham I., Hunt A.R., Collins J.E., Bruskiewich R. , Beare D.M., 

RA Clamp M . , Smink L.J., Ainscough R. , Almeida J. P., Babbage A.K., 

RA Bagguley C. , Bailey J., Barlow K.F., Bates K.N. , Beasley O.P., 

RA Bird CP., Blakey S.E., Bridgeman A.M., Buck D. , Burgess J., 

RA Burrill W.D., Burton J., Carder C, Carter N.P., Chen Y., Clark G., 

RA Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R. , 

RA Conroy D. , Corby N.R. , Coville G. J. , Cox A.V. , Davis J., Dawson E., 

RA Dhami P.D., Dockree C. , Dodsworth S.J., Durbin R.M. , Ellington A.G., 

RA Evans K.L., Fey J.M., Fleming K. , French L. , Garner A. A. , 

RA Gilbert J.G.R. , Goward M.E., Grafham D.V. , Griffiths M.N.D., Hall C. , 

RA Hall R.E., Hall-Tamlyn G. , Heathcott R.W., Ho S., Holmes S., 

RA Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., 

RA Laird G.K., Langford C.F., Leversha M.A., Lloyd C, Lloyd D.M., 

RA Martyn I.D., Ma shreghi- Mohammad i M. , Matthews L.H., Mccann O.T., 

RA Mcclay J., Mclaren S., McMurray A. A. , Milne S.A., Mortimore B.J., 

RA Odell C.N., Pavitt R., Pearce A.V., Pearson D. , Phillimore B.J.C.T., 

RA Phillips S.H., Plumb R.W. , Ramsay H. , Ramsey Y., Rogers L., Ross M.T., 

RA Scott C.E., Sehra H.K., Skuce CD., Smalley S., Smith M.L., 

RA Soderlund C, Spragon L. , Steward CA. , Sulston J.E., Swarm R.M. , 

RA Vaudin M. , Wall M . , Wallis J.M., Whiteley M.N., Willey D.L., 

RA Williams L., Williams S.A., Williamson H. , Wilmer T.E., Wilming L. , 

RA Wright C.L., Hubbard T. , Bentley D.R. , Beck S., Rogers J., Shimizu N., 

RA Minoshima S., Kawasaki K. , Sasaki T. , Asakawa S., Kudoh J., 

RA Shintani A., Shibuya K. , Yoshizaki Y. , Aoki N., Mitsuyama S., 

RA Roe B.A., Chen F., Chu L. , Crabtree J., Deschamps S., Do A., Do T. , 

RA Dorman A., Fang F. , Fu Y. , Hu P., Hua A., Kenton S., Lai H. , Lao H.I., 

RA Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E., Nguyen T., Pan H., 

RA Phan S., Qi S., Qian Y., Ray L. , Ren Q. , Shaull S., Sloan D. , Song L., 

RA Wang Q. , Wang Y. , Wang Z., White J., Willingham D. , Wu H., Yao Z., 

RA Zhan M., Zhang G., Chissoe S., Murray J., Miller N., Minx P., 

RA Fulton R., Johnson D., Bemis G., Bentley D. , Bradshaw H. , Bourne S., 

RA Cordes M. , Du Z., Fulton L. , Goela D. , Graves T. , Hawkins J., 

RA Hinds K., Kemp K. , Latreille P., Layman D. , Ozersky P., Rohlfing T., 

RA Scheet P., Walker C, Wamsley A., Wohldmann P., Pepin K. , Nelson J., 

RA Korf I., Bedell J. A. , Hillier L. , Mardis E., Waterston R . , Wilson R., 



RA Emanuel B.S., Shaikh T. , Kurahashi H., Saitta S., Budarf M.L., 

RA McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., Edelmann L. , 

RA Kim U.J., Shizuya H. , Simon M.I., Dumanski J. P., Peyrard M . , Kedra D. , 

RA Seroussi E . , Fransson I., Tapia I., Bruder C.E., O'Brien K.P., 

RA Wilkinson P. , Bodenteich A., Hartman K. , Hu X., Khan A.S., Lane L. , 

RA Tilahun Y. , Wright H. ; 

RT "The DNA sequence of human chromosome 22."; 

RL Nature 402:489-495(1999). 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC TISSUE=Brain; 

RX MEDLINE=22388257 ; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L . , Marusina K. , Farmer A. A. , Rubin G.M. , Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. # McEwan P.J., McKernan K.J. , Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E. J. , Lu'X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C. , Shevchenko Y. , Bouffard G.G. , 

RA Blakesley R.W., Touchman J.w. , Green E.D., Dickson M.C., 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [7] 

RP SEQUENCE OF 1-2 6 FROM N.A. 

RX PubMed=11829738; 

RA Castoldi M. , Chu M.-L.; 

RT "Structural and functional characterization of the human and mouse 

RT fibulin-1 gene promoters: role of Spl and Sp3 . " ; 

RL Biochem. J. 362:41-50(2002). 

RN [8] 

RP SEQUENCE OF 30-44. 

RX MEDLINE=89354537; PubMed=2527614 ; 

RA Argraves W.S., Dickerson K. , Burgess W.H., Ruoslahti E.; 

RT "Fibulin, a novel protein that interacts with the fibronectin 

RT receptor beta subunit cytoplasmic domain. "; 

RL Cell 58:623-629(1989). 

RN [9] 

RP SELF -ASSOCIATION AND INTERACTION WITH FN1 . 

RX PubMed=1400330; 

RA Balbona K. , Tran H. , Godyna S., Ingham K.C., Strickland D.K., 

RA Argraves W . S . ; 

RT "Fibulin binds to itself and to the carboxyl -terminal heparin-binding 

RT region of fibronectin."; 

RL J. Biol. Chem. 267:20120-20125(1992). 

RN [10] 

RP POSSIBLE FUNCTION. 

RX PubMed=7534784; 



RA Roark E.F., Keene D.R., Haudenschild C.C., Godyna S., Little CD. , 

RA Argraves W . S . ; 

RT "The association of human fibulin-1 with elastic fibers: an 

RT immunohistological , ultrastructural , and RNA study."; 

RL J. Histochem. Cytochem. 43:401-411(1995). 

RN [11] 

RP INTERACTION WITH FGB . 

RX PubMed=7642629; 

RA Tran H., Tanaka A., Litvinovich S.V., Medved L.V. , Haudenschild C.C., 

RA Argraves W . S . ; 

RT "The interaction of fibulin-1 with fibrinogen. A potential role in 

RT hemostasis and thrombosis."; 

RL J. Biol. Chem. 270:19458-19464(1995). 

RN [12] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=8737292; 

RA Miosge N. , Gotz W., Sasaki T. , Chu M.-L., Timpl R., Herken R. ; 

RT "The extracellular matrix proteins fibulin-1 and fibulin-2 in the 

RT early human embryo . " ; 

RL Histochem. J. 28:109-116(1996). 

RN [13] 

RP INDUCTION. 

RX MEDLINE=96133928; PubMed=855262 9 ; 

RA Clinton G.M., Rougeot C. , Derancourt J., Roger P., Defrenne A., 

RA Godyna S., Argraves W.S., Rochefort H.; 

RT "Estrogens increase the expression of fibulin-1, an extracellular 

RT matrix protein secreted by human ovarian cancer cells."; 

RL Proc. Natl. Acad. Sci . U.S.A. 93:316-320(1996). 

RN [14] 

RP CALCIUM, SELF -ASSOCIATION, AND FN1-BINDING SITES. 

RX PubMed=9278415; 

RA Tran H. , VanDusen W.J., Argraves W.S.; 

RT "The self -association and fibronect in-binding sites of fibulin-1 map 

RT to calcium-binding epidermal growth factor-like domains."; 

RL J. Biol. Chem. 272:22600-22606(1997). 

RN [15] 

RP ROLE IN TUMOR FORMATION AND INVASION. 

RX PubMed- 9393974; 

RA Qing J. , Maher V.M., Tran H., Argraves W.S., Dunstan R.W. , 

RA McCormick J. J.; 

RT "Suppression of anchorage- independent growth and matrigel invasion and 

RT delayed tumor formation by elevated expression of fibulin-lD in human 

RT f ibrosarcoma-derived cell lines."; 

RL Oncogene 15:2159-2168(1997). 

RN [16] 

RP INDUCTION. 

RX PubMed=98 11350; 

RA Roger P., Pujol P., Lucas A., Baldet P., Rochefort H. ; 

RT "Increased immunostaining of fibulin-1, an estrogen-regulated protein 

RT in the stroma of human ovarian epithelial tumors."; 

RL Am. J. Pathol. 153:1579-1588(1998). 

RN [17] 

■RP ROLE IN TUMOR FORMATION AND INVASION. 

RX PubMed=9466671; 

RA Hayashido Y., Lucas A., Rougeot C, Godyna S., Argraves W.S., 

RA Rochefort H. ; 

RT "Estradiol and fibulin-1 inhibit motility of human ovarian- and 



RT breast-cancer cells induced by f ibronectin. " ; 

RL Int. J. Cancer 75:654-658(1998). 

RN [18] 

RP INTERACTION WITH NOV. 

RX PubMed=9927660; 

RA Perbal B . , Martinerie C. , Sainson R. , Werner M., He B., Roizman B.; 

RT "The C-terminal domain of the regulatory protein NOVH is sufficient to 

Query Match 2 9.1%; Score 700; DB 1; Length 703; 

Best Local Similarity 34.7%; Pred. No. 8.1e-45; 

Matches 160; Conservative 65; Mismatches 148; Indels 88; Gaps 21; 

QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 
II - I I hlhll : I I hi III 

QCKSGF I QD - ALGNC I D I NECLS I SAPCP I GHTCI NTEGSYTC 334 

TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATDSHQCNPTQI C 119 

: II I II : :| MINIMI : | | 
QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRC 373 

I NTEGGYTCS CTDGYWL - - LEGQCLDI DEC - RY - -GYCQQLCANVPGSYSCTCNPGFTLN 174 

■\- I = I I lh : h II III Ml hh II h 

VNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYLCSCSVGFRLS 433 

EDGRSCQDVNECATEN PCVQTCVNTYGS F I CRCDPG YELE E - DGVHCSDMDECS F--SEF 231 

MM II III |||: | | ||:| : Ml | II 

VDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 4 92 
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= 1 : hi 11 = : I I I I II I : I MM I h: Hhhlllhh 

I CSYRCIN I PGS FQCS CPSSGYRLAPNGRNCQD I DECVTG I HNCS INETCFNI QGGFRCL 552 

DPI RCEEPYLRI SDN RCMCPAENPGCRDQPFTILYRDMDWSGRSV 336 

MM: lh II | : : : : | : 
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PADIFQMQA-TTRYPG- - AYYI FQI KSGNEGREF YMRQTGP I SATLVMTR 383 

I : I : : I I : I I II I II I || | : | 

REFTRPEEI IFLRAITPPHPASQANI IFDITEGNLRDSFDI IKRYM- -DGMTVGVVRQVR 664 
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RESULT 14 
FBL1JVIOUSE 

ID FBL1_M0USE STANDARD; PRT; 705 AA. 

AC Q08879; Q08878; Q8C3B1; Q91ZC9; Q922K8; 

DT 01-OCT-1994 (Rel . 30, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-l precursor (Basement -membrane protein 90) (BM-90) . 

GN FBLN1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 



OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS C AND D) , AND LIGANDS INTERACTION. 

RX MEDLINE=93358897; PubMed=8354280 ; 

RA Pan T.-C, Kluge M . , Zhang R.Z., Mayer U. , Timpl R. , Chu M.-L.; 

RT "Sequence of extracellular mouse protein BM-90/f ibulin and its 

RT calcium-dependent binding to other basement -membrane ligands."; 

RL Eur. J. Biochem. 215:733-740(1993). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC STRAIN=C57BL/6J; TISSUE=Head, and Urinary bladder; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA Okazaki Y. , Furuno M., Kasukawa T. , Adachi J., Bono H. , Kondo S., 

RA Nikaido I., Osato N. , Saito R., Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y. , Hasegawa Y. , Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A. , Quackenbush J. , 

RA Schriml L.M. , Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D. , Brusic v., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T. , Gariboldi M. , Gissi C, Godzik A., Gough J. , 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J. # Jarvis E.D., 

RA Kanai A., Kawa j i H., Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y. , Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L. , Marchionni L. , McKenzie L., Miki H., 

RA Nagashima T., Numata K. , Okido T. , Pavan W.J., Pertea G. , Pesole G., 

RA Petrovsky N. , Pillai R., Pontius J.U. , Qi D. , Ramachandran S., 

RA Ravasi T. , Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C. , Semple C.A. , Setou M. , Shimada K. , 

RA Sultana R. , Takenaka Y., Taylor M.S., Teasdale R.D. , Tomita M. , 

RA Verardo R., Wagner L., Wahlestedt C, Wang Y. , Watanabe Y., Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M., Yang I., Yang L. , 

RA Yuan Z., Zavolan M., Zhu Y. , Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H. , Nakamura M. , Sakazume N. , Sato K. , 

RA Shiraki T., Waki K. , Kawai J., Aizawa K. , Arakawa T. , Fukuda S., 

RA Hara A. , Hashizume W. , Imotani K. , Ishii Y. , Itoh M. , Kagawa I . , 

RA Miyazaki A., Sakai K. , Sasaki D . , Shibata K. , Shinagawa A., 

RA Yasunishi A. , Yoshino M. , Waterston R., Lander E.S., Rogers J., 

RA Birney E. , Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC TISSUE=Breast tumor; 

RX MEDLINE=22388257; PubMed-12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M . , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M . , Madan A., Rodrigues S., Sanchez A., 



RA Whiting.M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 1-2 6 FROM N.A. 

RX PubMed-11829738; 

RA Castoldi M., Chu M.-L.; 

RT "Structural and functional characterization of the human and mouse 

RT fibulin-1 gene promoters: role of Spl and Sp3 . " ; 

RL Biochem. J. 362:41-50(2002). 

RN [5] 

RP CHARACTERIZATION OF NID AFFINITY. 

RX PubMed=7844816; 

RA Sasaki T. # Kostka G., Goehring W., Wiedemann H., Mann K. , Chu M.-L., 

RA Timpl R. ; 

RT "Structural characterization of two variants of fibulin-1 that differ 

RT in nidogen affinity."; 

RL J. Mol. Biol. 245:241-250(1995). 

RN [6] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=8850569; 

RA Zhang H.-Y., Timpl R. , Sasaki T., Chu M.-L., Ekblom P.; 

RT "Fibulin-1 and fibulin-2 expression during organogenesis in the 

RT developing mouse embryo."; 

RL Dev. Dyn. 205:348-364(1996). 

RN [7] 

RP NID-BINDING SITE. 

RC STRAIN=129/Sv; 

RX PubMed=9299350; 

RA Adam S., Goehring W. , Wiedemann H., Chu M.-L., Timpl R. , Kostka G.; 

RT "Binding of fibulin-1 to nidogen depends on its C-terminal globular 

RT domain and a specific array of calcium-binding epidermal growth 

RT factor-like (EG) modules."; 

RL J. Mol. Biol. 272:226-236(1997). 

RN [8] 

RP BINDING TO LAMA2 . 

RX PubMed=10022829; 

RA Talts J.F., Andac Z., Goehring W., Brancaccio A., Timpl R.; 

RT "Binding of the G domains of laminin alphal and alpha2 chains and 

RT perlecan to heparin, sulfatides, alpha-dystroglycan and several 

RT extracellular matrix proteins."; 

RL EMBO J. 18:863-870(1999). 

RN [9] 

RP INTERACTION WITH AGC1 AND CSPG2 . 

RX PubMed=10400671; 

RA Aspberg A., Adam S., Kostka G., Timpl R. , Heinegaard D. ; 

RT "Fibulin-1 is a ligand for the C-type lectin domains of aggrecan and 

RT vers i can. " ; 

RL J. Biol. Chem. 274:20444-20449(1999). 

RN [10] 

RP INTERACTION WITH NID. 

RX PubMed=1158 9703; 



RA Ries A., Goehring W., Fox J.W. , Timpl R. , Sasaki T. ; 

RT "Recombinant domains of mouse nidogen-1 and their binding to basement 

RT membrane proteins and monoclonal antibodies."; 

RL Eur. J-. Biochem. 268:5119-5128 (2001). 

RN [11] 

RP DOWN -REGULATION BY GLUCOCORTICOIDS. 

RX PubMed=11737251; 

RA Gu Y.-C, Talts J.F., Gullberg D. , Timpl R. , Ekblom M . ; 

RT "Glucocorticoids down-regulate the extracellular matrix proteins 

RT fibronectin, fibulin-1 and fibulin-2 in bone marrow stroma."; 

RL Eur. J. Haematol. 67:176-184(2001). 

RN [12] 

RP TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE. 

RX PubMed=11238726; 

RA Ohsawa I., Takamura C, Kohsaka S.; 

RT "Fibulin-1 binds the amino-terminal head of beta-amyloid precursor 

RT protein and modulates its physiological function."; 

RL J. Neurochem. 76:1411-1420(2001). 

RN [13] 

RP INTERACTION WITH E6 , AND INHIBITION OF E6 -MEDIATED TRANSFORMATION. 

RX PubMed=12200142; 

RA Du M . , Fan X . , Hong E . , Chen J.J. ; 

RT "Interaction of oncogenic papillomavirus E6 proteins with fibulin-1."; 

RL Biochem. Biophys . Res. Commun. 296:962-969(2002). 

RN [14] 

RP DEVELOPMENTAL STAGE. 

RX PubMed=11836357; 

RA Debeer P., Schoenmakers E.F.P.M., Twal W.O., Argraves W.S., 

RA De Smet L. , Fryns J. -P., Van De Ven W.J.M.; 

RT "The fibulin-1 gene (FBLN1) is disrupted in a t(12;22) associated with 

RT a complex type of synpolydactyly . " ; 

RL J. Med. Genet. 39:98-104(2002). 

CC -!- FUNCTION: Incorporated into f ibronectin-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organization of ECM architecture, in particular to 

CC those of basement membranes. 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 
CC matrix components such as FN1 , LAMA1 , LMA2, NID, AGC1 , CSPG2 and 

CC type IV collagen. Interacts also with papillomavirus E6 proteins. 

CC Binding analysis demonstrated for isoform C a 100-fold stronger 

CC binding to the basement membrane protein' NID than for isoform D. 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing; Named isoforms=4; 

CC Name=D; 

CC Isold-Q08879-1; Sequence=Displayed; 

CC Name=A; 

CC IsoId=Q08879-3; Sequence=Not described; 

CC Name=B; 

CC IsoId=Q08879-4; Sequence=Not described; 

CC Name=C; 

CC IsoId=Q08879-2; Sequence=VSP_001386 ; 

CC Note=Conflict E -> A at position 571 of isoform C (Ref.l); 

CC -!- TISSUE SPECIFICITY: Detected in most organs (brain, heart, lung, 

CC spleen, liver and kidney) . Neurons are the predominant source of 



CC production in the brain. Not expressed significantly by astrocytes 

CC or microglia. 

CC -!- DEVELOPMENTAL STAGE: The differential expression of the fibulin 

CC family contributes to the formation of molecularly distinct 

CC extracellular matrices already during early developmental stages 

CC of a large number of tissues. Increase expression at neonate stage 

CC in the brain. Expressed in interdigital regions of the handplate 

CC of a 12 dpc embryo and in the lateral perichondria! region. 

CC Similar expression persists in the 13 dpc handplate particularly 

CC in the perichondrial regions and apical aspects of the developing 

CC digits. 

CC -!- INDUCTION: Glucocorticoids suppressed mRNA expression and protein 



CC synthesis. 

SIMILARITY 
SIMILARITY 
SIMILARITY 



Belongs to the fibulin family. 
Contains 3 anaphylatoxin-like domains. 
Contains 9 EGF-like domains. 



CC 
CC 
CC 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

Query Match 29.0%; Score 699; DB 1; Length 705; 

Best Local Similarity .35.0%; Pred. No. 9.6e-45; 

Matches 159; Conservative 67; Mismatches 154; Indels 74; Gaps 21; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNP VYRGPYSNPYS 60 

II HI I I hlhll : I I hi I I I 
Db 295 QCKSGFIQD-ALGNCIDINECLSISAPCPVGQTCINTEGSYTC 336 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM - DESNQCVDVDECATDSHQCNPTQI C 119 

I II : :| :||||||h : I I 
Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECSPPAEPCGKGHHC 375 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

'I: I : I I h= : hlhll II III Mh hh II h 

Db 376 LNSPGSFRCECKAGFYFDG I SRTCVD I NECQRYPGRLCGHKCENTPGS FHCS CSAGFRLS 435 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF- -SEF 231 

I! II I I 'III |||: | | ||:| : ||| | |:|||: 

Db 436 VDGRSCEDWEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 4 94 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

: I = hi II- I I I I II I : hlllhll I I- Hhhll hh 

Db 4 95 I CS YRCINI PGSFQCSCPSSGYRLAPNGRNCQDI DECVTGI HNCS INETCFNI QGSFRCL 554 

Qy 291 DPIRCEEPYLRISDN RCM- -CPAENPGC-RDQPFTILYRDMDWSGRSV- - 336 

I M I M lh I : I II h : : : : | 

Db 555 S - FECPENYRRSADTFRQEKTDTVRCI KSCRPNDEACVRDPVHTVSHTVI SLPTFREFTR 613 

Qy 337 PADIFQMQATT-RYPG- -AYYI FQIKSGNEGREFYM RQTGPISATLVMTRPI KGPRE 390 

I U -I I II I II I II I = I : I I I || 

Db 614 PEEI IFLRAVTPLYPANQADIIFDITEGNLRDSFDI IKRYEDGMTVGWRQVRPIVGPFY 673 

Qy 391 I QLDLEM - I TVNTVI NFRGSS VI RLR I YVSQYPF 423 

I III : I- I :|: = hlhl I 
Db 674 AVLKLEMNYVLGGWSHR - -NWNVHI FVSEYWF 705 



RESULT 15 
FBL1 BRARE 



ID FBL1_BRARE STANDARD; PRT; 681 AA. 

AC 042182; 042183; 

DT 15-SEP-2003 (Rel. 42, Created) 

DT 15-SEP-2003 (Rel. 42, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Fibulin-1 precursor. 

GN FBLN1 . 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprini formes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP SEQUENCE FROM N . A . (ISOFORMS C AND D) . 

RA Zhang H.-Y., Lardelli M. , Ekblom P.; 

RT "Sequence of zebrafish fibulin-1 and its expression in developing 

RT heart and other embryonic organs."; 

RL Dev. Genes Evol . 207:340-351(1997). 

CC -!- FUNCTION: Incorporated into f ibronectin-containing matrix fibers. 

CC May play a role in cell adhesion and migration along protein 

CC fibers within the extracellular matrix (ECM) . Could be important 

CC for certain developmental processes and contribute to the 

CC supramolecular organazation of ECM architecture, in particular to 

CC those of basement membranes. 

CC -!- SUBUNIT: Interacts with iself and with various extracellular 

CC matrix components such as FN1, LAMA1 , NID, AGC1 and CSPG2 . 

CC -!- SUBCELLULAR LOCATION: Secreted; extracellular matrix. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alternative splicing,- Named isoforms=2; 

CC Name=D; 

CC I sold=04 2182-1; Sequence=Displayed ; 

CC Name=C; 

CC IsoId=042182-2; Sequence=VSP_007379 ; 

CC -!- DEVELOPMENTAL STAGE: Isoform C is detected in the later blastula 

CC period, 4 h after fertilization. Isoform D is not detected at this 

CC stage, it first appears during the gastrula period in 8-h-old 

CC embryos. Expression of both isoforms is then maintained throughout 

CC development. During later developmental stages, prominent 

CC expression is seen in regions where tissue compartments are 

CC continuously moving in relation to each other. 

CC -!- SIMILARITY: Belongs to the fibulin family. 

CC -!- SIMILARITY: Contains 3 anaphylatoxin-like domains. 

CC -!- SIMILARITY: Contains 9 EGF-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF013751; AAB80944.1; -. 

DR EMBL; AF013752; AAB80945.1; -. 

DR HSSP; P35555; 1EMN . 



DR 


ZFIN; ZDB 


-GENE-990415-73; 


fblnl. 






DR 


InterPro; 


IPR000020; Anaphylatoxin . 




DR 


InterPro; 


IPR000152; Asxjiydroxyl . 




DR 


InterPro ; 


IPR001881; EGF_ 


Ca. 






DP 


InterPro; 


IPR006209; EGF 


like. 






DR 


InterPro; 


IPR001673; S_mold repeat. 




DR 


Pfam; PF01821; ANATO; 1. 








DR 


Pfam; PF00008; EGF; 4. 








DR 


SMART; SM00104; ANATO; 2. 








DR 


SMART; SM00179; EGF_CA; 5 








DR 


PROSITE; 


PS01177; 


ANAPHYLATOXIN 


1; l. 




DR 


PROSITE; 


PS01178; 


ANAPHYLATOXIN 


2; 1. 




DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


3 . 




DR 


PROSITE; 


PS01186; 


EGF_2 ; 


3. 






DR 


PROSITE; 


PS01187; 


EGF CA; 


6. 






KW 


Signal; Alternative splicing; Glycoprotein; Extracellular 


KW 


Repeat; EGF- like 


V_Av_J I 1 ICl J.11 / 


Calcium-binding . 




FT 


SIGNAL 


1 


17 


POTENTIAL. 




FT 


CHAIN 


1 ft 

X o 


D O X 


FIBULIN-1 . 




FT 


DOM A I N 


29 


D J 


ANAPHYLATOXIN -LIKE 1 




FT 


DOMAIN 


68 


107 


ANAPHYLATOXIN- LIKE 2 




FT 


DOM A T N 


1 Oft 

X w o 


X J .7 


ANAPHYLATOXIN-LIKE 3 




FT 


DOMAIN 


158 


-L _7 


EGF -LIKE 1. 




FT 


DOM A T N 


X J7 J 


JO 


EGF -LIKE 2, CALCIUM- 


r t md t 


FT 


DOMATN 

LJ\Jl Lt\ X XV 


o o q 
^ j j 


9 ft A 


EGF-LIKE 3, CALCIUM- 


■RTMHTTVin 
OX lMUlxNo 


FT 

F X 


DOMATN 

U WJ/Xrt X XM 


9 ft ^ 

£t O J 


7 1 

jjX 


EGF -LIKE 4, CALCIUM- 


D X 1M U X X\J VJ 


FT 
c x 


DOM A TN 

JJUl In X XM 


j jZi 


^7^ 


EGF-LIKE 5, CALCIUM- 


15 X IMJJlXNO 


FT 

C X 


DOM A TN 


^7A 


Ait; 


EGF-LIKE 6, CALCIUM- 


13 X 1NU1 JNVj 


f i 


nf)M ATM 


^XD 


ylCC 

ft 33 


EGF-LIKE 7, CALCIUM - 


T3 T Mfl T MP 
OX1MJJ1 


FT 

C X 


DOM A TN 


/CC 
rk D D 


A Q Q 

ft y -3 


EGF-LIKE 8, CALCIUM- 


ta T TvTFiT MP 
OXIMDl JNVj 


FT 


nnMa TM 

XJVJlMrl X XM 


jUU 




EGF-LIKE 9, CALCIUM- 


■D T MFl T MP 


FT 


DJ C!T TT .FT D 

JJlOULir XU 


9 Q 


C C 
O D 


BY 


SIMILARITY. 




FT 


DT QT TT .FTH 
uiouur xu 


7 n 

J u 


£9 


BY 


SIMILARITY. 




FT 


1™\T CT TT CTn 

JJIoUxjf ID 




DJ 


BY 


SIMILARITY. 




FT 


DISULFID 


72 




BY 


SIMILARITY. 




FT 


DISULFID 


85 


104 


BY 


SIMILARITY. 




FT 


DISULFID 


106 


125 


BY 


SIMILARITY. 




FT 


DISULFID 


107 


138 


BY 


SIMILARITY. 




FT 


DISULFID 


114 


139 


BY 


SIMILARITY. 




FT 


DISULFID 


162 


171 


BY 


SIMILARITY. 




FT 


DISULFID 


167 


176 


BY 


SIMILARITY. 




FT 


DISULFID 


178 


191 


BY 


SIMILARITY. 




FT 


DISULFID 


197 


210 


BY 


SIMILARITY. 




FT 


DISULFID 


204 


219 


BY 


SIMILARITY. 




FT 


DISULFID 


225 


237 


BY 


SIMILARITY. 




FT 

r x 


DISULFID 


243 


256 


BY 


SIMILARITY. 




FT 


DISULFID 


250 


265 


BY 


SIMILARITY. 




FT 


DISULFID 


271 


283 


BY 


SIMILARITY. 




FT 

F J. 


DISULFID 


289 


301 


BY 


SIMILARITY. 




FT 

F X 


DISULFID 


317 


330 


BY 


SIMILARITY. 




FT 
r ± 


DISULFID 


336 


348 


BY 


SIMILARITY. 




FT 


DISULFID 


343 


357 


BY 


SIMILARITY. 




FT 


DISULFID 


359 


372 


BY 


SIMILARITY. 




FT 


DISULFID 


378 


390 


BY 


SIMILARITY. 




FT 


DISULFID 


386 


399 


BY 


SIMILARITY. 




FT 


DISULFID 


401 


414 


BY 


SIMILARITY. 




FT 


DISULFID 


420 


429 


BY 


SIMILARITY. 





matrix; 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



FT 


DISULFID 


440 


454 


BY SIMILARITY 

i— ' X k_J X XI X 1 lril\ X i. X ■ 


FT 


DISULFID 


460 


473 


BY SIMILARITY 


FT 


DISULFID 


469 


482 


BY SIMILARITY. 


FT 


DISULFID 


484 


498 


BY SIMILARITY. 


FT 


DISULFID 


504 


517 


BY SIMILARITY. 


FT 


DISULFID 


511 


526 


BY SIMILARITY. 


FT 


DISULFID 


531 


553 


BY SIMILARITY. 


FT 


CARBOHYD 


173 


173 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


542 


681 


RPRVDRADI I RCVKSCOHNDI SCVLNPI LSHSHTAI SLPTF 


FT 








REFNKPEEI VFT.R c; PTPTHT iPHMH^ PFT VYDT T .FfJNT DMCIP 

xm i vx. x-i4_j x v i> xjxxuj l i r i i lur l xi xlj^Lj xt x_i x V J. x-/ x J J x_i\_jx.>l X y/x,>l O C 


FT 








DI I KRLDHGM I VGWKQVRPLVG P WTVLKLAMN YVTNGW 


FT 








SHRNI I NVRI YVSEFWF -> RCERLSCNESNECMAFTRRI 


FT 








TYYQLTFPAKI PVPTDLFRMGPSNTALGDDI EVAI VDGNRD 


FT 








GFFAAKRLDHGG VL VLQKP I AW PQDFQ I ALEMKLKRFGHLS 


FT 








I YLFKI RPVRHARRHQQRY (in isoform C) . 


FT 








/FTId=VSP_007379. 


SQ 


SEQUENCE 


681 AA; 


74459 


MW; 175C966305A46699 CRC.64; 



Query Match 26.2%; Score 629.5; DB 1; Length 681; 

Best Local Similarity 32.8%; Pred. No. 1.3e-39; 

Matches 151; Conservative 64; Mismatches 160; Indels 85; Gaps 21; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II II I I hlhll - II II I I = =| : 
Db 270 QCAAGF I QD ~ ALGS C I D I NECVS VTALSRG - QMCFNTVGS F I CQRHS 314 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMD - ESNQCVDVDECATDSHQCNPTQI C 119 

= I M - I Hlhllll = h I 
Db 315 VTCGRGYHLNAEGTRCVDI DECAGPDNSCD-GHGC 348 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDI DECRYGY CQQLCANVPGSYSCTCNPGFTL 173 

ii i i i i i= i mill i i i h mi hi ii i 

Db 349 I NLVGS YRCECRTGF I FNS I SRSCEDI DECR - NYPGRLCAHKCEN I LGS YKCSCTAGFKL 407 

Qy 174 NEDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF--SE 230 

Hlhl Mill : : I I I I I llh I I : I | I II : 

Db 408 ADDGRNCDDVNECES - S PCSQGCANVYGS YQS YCRRG YQLSDADG I TCEDI DECALPTGG 466 

Qy 231 FLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKC 289 

= 1 = I I ll* = hll II I : lllllhll I :':h. II 

Db 467 HICSYRCHNTPGSFHCTCPASGYTLAANGRSCQDIDECLTGTHSCSESESCFNIQGGFRC 526 

Qy 290 I DPI RCEEPYLRI SDN RCM- -CPAENPGCRDQPFTILYRDMDWS 332 

I I I I lh I : I I II :| 

Db 527 LS-FDCPANYRRSGDTRPRVDRADI IRCVKSCQHNDISCVLNP- - I LSHSHTAI SLPTFR 583 

Qy 333 GRSVPADI FQMQATT- RYPGAYYI FQI KSGNEGREFYMRQT GPISATLVMTRP 384 

: I :| = = : I : : I II I : = II = II 

Db 584 EFNKPEEIVFLRSPTPTHLPHMDSPEIVYDILEGNIQNSFDIIKRLDHGMIVGWKQVRP 643 

Qy 385 I KGPREI QLDLEM - I TVNTVI NFRGSS VI RLRI YVSQYPF 423 

= M Ml I I- I ::| :|||lh: I 

Db 644 LVGPVRTVLKLAMNYVTNGWSHR--NIINVRI YVSEFWF 681 



Search completed: January 9, 2 004, 12:34:07 
Job time : 11.713 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



January 9, 2004, 12:32:07 ; Search time 32.5385 Seconds 

(without alignments) 
3354.684 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-09-674-379A-14 
2407 

1 QCTNGFDLDRQSGQCLDIDE I NFRGSS VI RLR I YVSQYPF 423 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



830525 



Database 



SPTREMBL 23 : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria: * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_ar cheap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 
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52 
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44 7 


4 


09H7n c ; 
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11 


ORBPBS 
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5 
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7ft 7 


11 
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11 


0922K8 
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7 


741 R 




. o 


fift R 

D O J 


11 
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8 
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U O J 
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Q t - A bnmn cani on 
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j v . 


4 


1174 


11 


099KRft 
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^ J LLILlo UIUqLUIU 


10 
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O 1MLJ LJ J 
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29 


. 6 
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13 


073774 
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12 
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29 , 


. 2 
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4 
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4 
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14 
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29 , 
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1231 


4 


Oft IUI 0 
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15 
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, 9 
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16 


674 . 5 


28 . 


. 0 
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4 


09HB05 




17 


629.5 
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. 2 


681 


13 


042182 


04 218 2 hrarhvdanin 


18 


587 
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698 


5 


09V4B8 


09v4bft dTOcjrvnhi 1^ 
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09V c Ift 9 




21 
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22 
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4 
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DQfiT*w7 hnmn Qs*r»"i on 
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SI ft S 
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7 ft 79 


1 1 
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09WT TWR 
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?4 


R1 7 R 
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j. . 
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A 


y -? o i\.o y 


yyoKoy noino Sapisn 


25 


516 . 5 


21 . 


. 5 


13 98 


13 


OftAXMfi 
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26 
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( 4 


1 7ft 9 

J- J O J? 


11 


Oft mi ft 

^ O V_VJ X o 


yOLy lu IILUb lllUbLUlU 


27 
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1713 


11 


Oftrm q 
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y^/O^yxj IILU.O lllUoLUl U 


28 
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21 . 


. 4 
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13 


0ftiTFZ4 

^UU 1 Ul 


C\ A ~i f 74 v"onmm i c 1 so 

yO J XZil j\-C1JLJ^J Llo X C* C 


29 
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2 1 . 


3 


3857 


11 


OR ft A4 0 

W O O O M w 


OARR4H mno mi i a pi 1 1 1 1 
UOOOtU LUU.O lllLXo U.X LI 


30 


512 


2 1 . 


3 


1713 


1 1 


Oft ft 74 9 


O ft ft "\ 4 Q mi i a mi i a m il n 


31 
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2 1 


> -j 




4 




flQ(\TT*\fl hnmn aann on 
Vj-?D jpo IKJIIICJ bcipxcll 


32 


510.5 


21 


2 


7ft 7 

/Of 


1 1 
X X 


OftKDfil 
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i^OJs.UDl lllUo lllUfaL.UxU 


33 


SOft 

J V o 


21 


1 


1 ft ft 
X o o 


1 1 

X X 


OftPI Tift 


O A v 1 1 1 A mi i q mi i o m i 1 1 1 


34 
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21 . 


, i 


70ft 


1 7 

X J 


Pft 7^67 


PA7"^fi7 rr^lliiQ nail 


35 


506 . 5 


21 . 


0 


*j J7 VJ D 


11 


D9WT TM9 
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36 
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11 


OftBNH7 
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37 
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5 


09RT..T1 

\s J O XJU X 
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38 
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20 . 


, 1 
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O?ft019 hn=? ^P5^^"r^^c^ 


39 
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20 , 


, 0 


1764 


11 


035806 

JU uu 


vj J o w o laLiua uui v 


40 


480.5 


20. 


, 0 


517 


4 


Q9NP01 


Q9np01 homo sapien 


41 


474 


19. 


.7 


746 


4 


Q96HB9 


Q96hb9 homo sapien 


42 


474 


19. 


7 


893 


6 


Q8MJK0 


Q8mjk0 cercopithec 


43 


474 


19. 


7 


1256 


4 


Q9NS15 


Q9nsl5 homo sapien 


44 


474 


19. 


7 


1382 


4 


Q9H7K2 


Q9h7k2 homo sapien 


45 


471.5 


19. 


6 


1095 


11 


Q60784 


Q60784 mus musculu 



ALIGNMENTS 



RESULT 1 
Q96TF5 

ID Q96TF5 PRELIMINARY; PRT; 443 AA. 

AC Q96TF5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE Mutant p53 binding protein 1 (MBP1) . 

GN MBP1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Crandata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tanka S . ; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Tanaka S., Sugimachi K. , Sugimachi K. ; 

RT . "Human mutant p53 binding protein (MBP1) ."; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL ; AB030655; BAA92880.1; -. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR001491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PR00907; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE;- PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW EGF- like domain. 

SQ SEQUENCE 443 AA; 49421 MW; 9CE175F4F38 8A56D CRC64 ; 



Query Match 53.0%; Score 1276.5; DB 4; Length 443; 

Best Local Similarity 52.0%; Pred. No. 2.2e-117; 

Matches 220; Conservative 67; Mismatches 119; Indels 17; Gaps 4; 



Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= = = I I I : iiiiihhi hi = i 

Db 38 ECTDGYEWDPDSQHCRDVNECLTI PEACKGEMKCINHYGGYLCLPRSAAVINDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

M I II II I Ih h : llllll I I I hi I 

Db 94 EGP-PPPVPPAQHPN PCPPGYEPDDQDSCVDVDECAQALHDCRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi III : M lIM III I |:|||: | | III I : III 

Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSF I CRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

Mill II I I Ih" I I I I I II llhlllhl :|lh llh 

Db 202 VDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

II = 111 II II I lllhlll I i III I II II Mh 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYI 320 

Qy 301 RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

= 1 = 111 = 111 II Ihll :h:| I : I lllllhlhllh Mill - llh: 
Db 321 QVSENRCLCPASNPLCREQPSSI VHRYMTI TSERSVPADVFQI QATSVYPGAYNAFQI RA 380 



Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQ 420 

II W \ ■■ I I Ih Ih I I I I |||||:|:|:::::| II II I 
Db 381 GNSQGDFYIRQINNVSAMLVIARPVTGPREYVLDLEMVTMNSLiMSYRASSVLRLTVFVGA 44 0 

Qy 421 YPF 423 

I I 

Db 441 YTF 443 



RESULT 2 
Q9JM06 

ID Q9JM06 PRELIMINARY; PRT; 443 AA. 

AC Q9JM06; 

DT 01-OCT-2000 (TrEMBLrel . 15, Created) 

DT 01-OCT-2000 (TrEMBLrel . 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE EGF-containing f ibul in-1 ike extracellular matrix protein 2. 

GN EFEMP2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20435063; PubMed=10982184 ; 

RA Katsanis N. , Venable S., Smith J.R., Lupski J.R.; 

RT "Isolation of a paralog of the Doyne honeycomb retinal dystrophy gene 

RT from the multiple retinopathy critical region on llql3."; 

RL Hum. Genet. 106:66-72(2000). 

DR EMBL; AF109122; AAF65189.1; -. 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI: 1891209; Ef emp2 . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR001491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 4. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 6. 

KW EGF-like domain; Matrix protein. 

SQ SEQUENCE 443 AA; 49452 MW; 5AEC2A9104 8B336A CRC64 ; 

Query Match 52.9%; Score 1273.5; DB 11; Length 443; 

Best Local Similarity 52.0%; Pred. No. 4.3e-117; 

Matches 220; Conservative 65; Mismatches 121; Indels 17; Gaps 3; 

Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

Hhh: I I I h:|| lllllhhl hi IIIIMh 
Db 38 ECTDGYEWDADSQHCRDVNECLTIPEACKGEMKCINHYGGYLCLPRSAAVINDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

I I I M : I I Ih II lllllll I I hi I 

Db 94 EGPPPPAA HAQQPNPCPQGYEPDEQESCVDVDECTQALHDCRPSQDCH 141 



Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi III = MMIIIMI Ml | I : I I I : I I III I : III 

Db 142 NLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDWECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

Mill II I I hlhhllh | | | | || llhll I :|lh llh 

Db 2 02 VDVNECDMGAPCEQRCFNSYGTFLCRCNQGYELHRDGFSCSDIDECGYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYL 300 

II : I II II II I lllhlll I I Ml I IhMM II Mh 

Db 262 PGRFSCHCPQGYQLL -ATRLCQDI DECETGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYV 32 0 

Qy 301 RI SDNRCMCPAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

-llllhill II I h I ! :h:| I : I I I I I I I : I I : I I I : Mill Mhl 
Db 321 QVSDNRCLCPASNPLCREQPSS I VHRYMS I TSERSVPADVFQI QATSVYPGAYNAFQI RS 38 0 

Qy 361 GNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEMITWTVINFRGSSVIRLRI YVSQ 420 

II ! Ml Ih I MM I I I I I : I : I : - : : I llhll 

Db 381 GNTQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGA 440 

Qy 421 YPF 423 

Db 441 YTF 443 

RESULT 3 
Q9H3D5 

ID Q9H3D5 PRELIMINARY; PRT; 443 AA.. 

AC Q9H3D5; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin-like extracellular matrix protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Seibold S., Marx M. ; 

RT "Cloning of a new fibulin-like gene."; 

RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF124486; AAG45245.1; -. 

DR HSSP; P35555; 1EMN . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR0062 09; EGF_like. 

DR InterPro; IPR001491; Thrmbomoduln . 

DR Pfam; PF00008; EGF; 3. 

DR PRINTS; PRO 09 07; THRMBOMODULN. 

DR SMART; SM00179; EGF_CA; 4. 

DR PROSITE; PS00010; ASX_HYDROXYL; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 5. 

KW EGF-like domain; Matrix protein. 

SQ SEQUENCE 443 AA; 49535 MW; D91784BF36A8A060 CRC64 ; 



Query Match 52.1%; Score 1253.5; DB 4; Length 443; 

Best Local Similarity 51.3%; Pred. No. 4e-115; 

Matches 217; Conservative 67; Mismatches 122; Indels 17; Gaps 4; 



Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

: I h I - I I I I 111111 = 1^1 hi Mllhlh I 
Db 38 ECTDGYEWDPDSQHCRDVNECLTI PEACKGEMKCINHYGGYLCLPRSAAVTNDLHG 93 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI 120 

II I II II I lh i : MINIM I hi I 

Db 94 ---EGP-PPPVPPAQHPN PCPPGYEPDDQDSCVDVDECAQALHDRRPSQDCH 141 

Qy 121 NTEGGYTCSCTDGYWLLEGQCLDIDECRYGYCQQLCANVPGSYSCTCNPGFTLNEDGRSC 180 

I I I hi III = MMIIIMI III I h I | | Ml I : III 

Db 142 NLSGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCEPGFQLGPNNRSC 201 

Qy 181 QDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQ 24 0 

Mill II I I i MM || llhlllhl I I 

Db 2 02 VDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSYLCQYRCVNE 261 

Qy 241 PGTYFCSCPPGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DPI RCEEPYL 300 

II : I II II II I lllhlll I h III I IhMM II Mh 

Db 262 PGRFSCHCPQGYQLL-ATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRCVEPYI 320 

Qy 301 RI SDNRCMCPAENPGCRDQPFTILYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KS 360 

■ I- I II I I I | : | | I I I : I | : | | | : Mill III- 

Db 321 QVSENRCLCPASNPLCREQPSS I VHRYMTI TSERMRPADVFQI QATS VYPGAYNAFQI RA 380 

Qy 361 GNEGREF YMRQTGP I SATLVMTRP I KGPRE I QLDLEM I TVNTVI NFRGSS VI RLRI YVSQ 420 

II Ml lh lh MM I I I I I = h h : : : : I Mhll 

Db 381 GNSQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGA 440 

Qy 421 YPF 423 

I I 

Db 441 YTF 443 



RESULT 4 
Q8BPB5 

ID Q8BPB5 PRELIMINARY; PRT; 493 AA. 

AC Q8BPB5; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Epidermal growth factor-containing fibulin-like extracellular matrix 

DE protein 1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6J; TISSUE=Xiphoid cartilage; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 



RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK077302; BAC36738.1; -. 

SQ SEQUENCE 493 AA; 54952 MW; 9CEDC7BF2FF943 OF CRC64 ; 

Query Match 45.6%; Score 1096.5/ DB 11; Length 493; 

Best Local Similarity 42.9%; Pred. No. 1.4e-99; 

Matches 201; Conservative 69; Mismatches 151; Indels 47; Gaps 5; 

Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

llhl- | || IMM :hlhl I III llllhhl 

Db 28 QCTDGYEWDP I RQQCKDI DECDI VPDACKGGMKCVNHYGGYLCLPKTAQI I VNNEHPQQE 87 

Qy 61 TPYS GPYPAAAPPLSAPNYPT 81 

II : | : |:| :: | | 

Db 88 TPAAEASSGATTGTVAARSMATSGWPGGGFMASATAVAGPEVQTGRNNFVIRRNPADPQ 147 

Qy 82 ISRPLICRFGYQMDESNQCVDVDECATDSHQCNPTQICINTEGGYTCSCTDGYWL 136 

I = I lh I I I hill I I hill I HI I II 

Db 148 RIPSNPSHRIQCAAGYEQSEHNVCQDIDECTSGTHNCRTDQVCINLRGSFTCQCLPGYQK 207 

Qy 137 LEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATENPCVQT 195 

Ihlllll II I I I Mh I hill I : U hill I I I 

Db 208 RGEQCVDIDECTVPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDASNQCAQQ 267 

Qy 196 CVNTYGSFICRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCPPGYILL 255 

I I llllhh I I I I I I hill I :|lh:||hll = I II II - 

Db 268 CYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYEW 327 

Qy 256 DDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDPIRCEEPYLRISDNRCMCPAENPG 315 

'MINIMI II = hi llhl h: h hllhll I 

Db 328 -RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDHYVLTSENRCVCPVSNTM 385 

Qy 316 CRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYMRQTGPI 375 

lh I :hh I : I MlhlllhlM I hllllll llhlll 

Db 386 CRELPQS I VYKYMS I RSDRS VPSDI FQI QATMI YANTI NTFR I KSGNENGEFYLRQTS PV 445 

Qy 376 SATLVMTRPIKGPREIQLDLEMITVNTVINFRGSSVIRLRIYVSQYPF 423 

II lh : = I I I I I' '! II I | | : | 

Db 44 6 SAMLVLVKSLSGPREYIVDLEMLTVSSIGTFRTSSVLRLTI IVGPFSF 4 93 

RESULT 5 
Q8K0J4 

ID Q8K0J4 PRELIMINARY; PRT; 387 AA. 

AC Q8K0J4; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Colon; 



RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC031184; AAH31184.1; 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF . 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00181; EGF; 5. 

DR SMART; SM00179; EGF_CA; 5. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 5. 

KW Hypothetical protein. 

SQ SEQUENCE 387 AA; 43334 MW; EE88DC9D1422C1C7 CRC64; 

Query Match 39.9%; Score 960.5; DB 11; Length 387; 

Best Local Similarity 48.3%; Pred. No. 2.7e-86; 

Matches 171; Conservative 56; Mismatches 122; Indels 5; Gaps 4; 

Qy 71 APPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQCNPTQI CI NTEGGYTCSC 130 

II l : l I : I I 1= III hill : H I hill I HI I 

Db 38 ADPQRI PSNP - - SHRI QCAAGYEQSEHNVCQDI DECTSGTHNCRTDQVCINLRGSFTCQC 95 

Qy 131 TDGYWLLEGQCLDIDECRY-GYCQQLCANVPGSYSCTCNPGFTLNEDGRSCQDVNECATE 189 

II IhlMM II I I I Mh | hl|| | : :| III 

Db 96 LPGYQKRGEQCVDIDECTVPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDAS 155 

Qy 190 NPCVQTCVNTYGSF I CRCDPGYELEEDGVHCSDMDECSFSEFLCQHECVNQPGTYFCSCP 24 9 

I I I I I I I I I I I = = l hill I :Mh:|lhll : I II 

Db 156 NQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCP 215 

Qy 250 PG YI LLDDNRSCQD I NECEHRNHTCNLQQTCYNLQGGFKC I DP I RCEEP YLR I SDNRCMC 309 

II == :hllllllll II : hi Mhl h: h hllhl 

Db 216 QGYEW-RSRTCQDINECETTNE-CREDEMCWNYHGGFRCYPRNPCQDHYVLTSENRCVC 273 

Qy 310 PAENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTRYPGAYYI FQI KSGNEGREFYM 369 

I I lh I :hh I : I MlhlllhlM I hllllll Mh 

Db 274 PVSNTMCRELPQS I VYKYMS I RSDRSVPSDI FQI QATMI YANTINTFRI KSGNENGEFYL 333 

Qy 370 RQTGP I SATLVMTRP I KGPREI QLDLEMI TVNTVI NFRGSSVI RLRI YVSQYPF 423 

III hi I lh .: : MM II: II llhll I I = I 

Db 334 RQTSPVSAMLVLVKSLSGPREYI VDLEMLTVSS I GTFRTSSVLRLTI I VGPFSF 387 



RESULT 6 
Q922K8 

ID Q922K8 PRELIMINARY; PRT; 685 AA. 

AC Q922K8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to fibulin 1. 

GN FBLN1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 



OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (APR-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE =Urinary bladder; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 

DR EMBL; BC007140; AAH07140.1; -. 

DR EMBL; AK035388; BAC29054.1; -. 

DR MGD; MGI: 95487; Fblnl . 

DR InterPro; I PRO 00 020; Anaphyla toxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGFJLike. 

DR Pfam; PF01821; ANATO; 3. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 3. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF -like domain. 

SQ SEQUENCE 685 AA; 75283 MW; EF0D77D7F66B73B8 CRC64 ; 

Query Match 3 0.8%; Score 741.5; DB 11; Length 685; 

Best Local Similarity 36.5%; Pred. No. 2.1e-64; 

Matches 158; Conservative ' 64; Mismatches 152; Indels 59; Gaps 16; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I! I I hlhll :| I hi III 
Db 295 QCKSGF I QD -ALGNC I DI NECLS I SAPCPVGQTCI NTEGS YTC 336 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM-DESNQCVDVDECATDSHQCNPTQI C 119 

! I II : =| :||Mllh = I I 

Db 337 QKNVPN CGRGYHLNEEGTRCVDVDECS PPAEPCGKGHHC 375 

Qy 12 0 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

= 1= I = I I l = : = 1 = 11 = 11 II III llh 1=1= II 1 = 

Db 376 LNSPGSFRCECKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFHCSCSAGFRLS 435 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE - DGVHCSDMDECSF - - SEF 231 

I MINI MM llh | | I : Ml | |:|||: 

Db 436 VIJGRSCEDVNEC-LNSPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 4 94 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

: I = 1 = 1 lh= I I I I II I = 1 = 1111 = 11 I h= :|| = l = ll hh 

Db 495 ICSYRCINIPGSFQCSCPSSGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQGSFRCL 554 



Qy 291 DPI RCEEPYLRISDNRC-MCPA-ENPGCRDQPFTI LYRDMDWSGRSVPADI FQMQATTR 348 

M I I =1 II I II I III: : Ml :hl 
Db 555 S - FECPENYRRSADTRCERLPCHENQECPRLPLRI TYYHLSFPTNI QVPAWFRMGPSSA 613 

Qy 349 YPGAYYI FQI KSGNEGREFYMRQTGP I SATLVMTRPI KGPREI QLDLEM- - I TVNTVINF 406 

II I Hll I h I : -I'M ||:: I I II I 

Db 614 VPGDSMQLAITAGNEEGFFTTRKVSHHSGWALTKPIPEPRDLLLTVKMDLYRHGTVSSF 673 

Qy 407 RGSSVIRLRIYVS 419 

I :| hll 
Db 674 '- -- -VAKLFIFVS 682 



RESULT 7 
Q8C3B1 



ID Q8C3B1 PRELIMINARY; PRT; 68 5 AA. 

AC Q8C3B1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Head; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK086451; BAC39669.1; -. 

SQ SEQUENCE 685 AA; 75282 MW; E38377D35B08C560 CRC64 ; 



Query Match 30.8%; Score 741.5; DB 11; Length 685; 

Best Local Similarity 36.5%; Pred. No. 2.1e-64; 

Matches 158 ; Conservative 64 ; Mismatches 152 ; Indels 59 ; Gaps 16 ; 



Qy 1 QCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIPRTNPVYRGPYSNPYS 60 

M HI I I hlhll =| | hi Ml 
Db 295 QCKSGFIQD-ALGNCIDINECLSISAPCPVGQTCINTEGSYTC 336 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 

: II I II = =| :|IMIIh : J I 

337 QKNVPN CGRGYHLNEEGTRCVDVDECSPPAEPCGKGHHC 375 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

: I = I : I I I- : hlhll II III Mh hh II h 

Db 376 LNSPGSFRCKCKAGFYFDGISRTCVDINECQRYPGRLCGHKCENTPGSFHCSCSAGFRLS 435 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE-DGVHCSDMDECSF- -SEF 231 

llllhlllll Mil Mh | | I : Ml | |:|||: 

Db 436 V^GRSCEDWEC-LNSPCSQECANVTGSYQCYCRRGYQLSDWGVTCEDIDEOVLPTGGH 494 



Qy 232 LCQHECWQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

I : hi lh: MM I I : MMMMI I I MhhM I 

Db 495 I CS YRCINI PGSFQCSCPSSGYRLAPNGRNCQDI DECVTGI HNCS INETCFNI QGSFRCL 554 

Qy 291 DPI RCEEP YLR I SDNRC -MCPA - ENPGCRDQPFTI LYRDMD WSGRS VPAD I FQMQATTR 348 

I I I I M M I M I III: : III MM 
Db 555 S-FECPENYRRSADTRCERLPCHENQECPRLPLRITYYHLSFPTNIQVPAWFRMGPSSA 613 

Qy 349 YPGAYYI FQI KSGNEGREFYMRQTGPI SATLVMTRPI KGPREIQLDLEM- - ITVNTVTNF 406 

II I MM I h I : MMI M I I M M 

Db 614 VPGDSMQLAITAGNEEGFFTTRKVSHHSGWALTKPIPEPRDLLLTVKMDLYRHGTVSSF 673 

Qy 4 07 RGSSVIRLRIYVS 419 

I M IMI 

Db 674 VAKLFIFVS 682 



RESULT 8 
Q8TBH8 

ID Q8TBH8 PRELIMINARY; PRT; 683 AA. 

AC Q8TBH8; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RA Strausberg R.; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC0224 97 ; AAH224 97 . 1 ; -. 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asxjiydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 3. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 

DR PROSITE; PS01178; ANAPHYLATOXIN_2 ; 3. 

DR PROSITE; PS00010; ASX_HYDROXYL; 4. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF -like domain. 

SQ SEQUENCE 683 AA; 74423 MW; 2665A3 961B6403B4 CRC64 ; 

Query Match 30.6%; Score 735.5; DB 4; Length 683; 

Best Local Similarity 35.7%; Pred. No. 8.1e-64; 

Matches 155; Conservative 66; Mismatches 152; Indels 61; Gaps 16; 
Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II Ml I I IMIMI Ml IM I I I 



Db 



293 QCKSGF I QD -ALGNC I D I NECLS ISAPCPI GHTCI NTEGSYTC 



334 



Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 

I I M : :| :|||||||| : I I 

Db 335 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRC 373 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

hll I : |:|::|| || I II |:|: || | 

Db 374 VNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYLCSCSVGFRLS 433 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE - DGVHCSDMDECSF - - SEF 231 

MMI I :|| I | | |||: | | ||:| : II | || 
Db 434 VDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 4 92 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

= 1 : hi ll = = WW II I : i I h: ! T, ■ : 

Db 493 ICSYRCINIPGSFQCSCPSSGYRLAPNGSNCQDIDECVTGIHNCSINETCFNIQGGFRCL 552 

Qy 291 DPI RCEEPYLR I SDNRC -MCPA - ENPGCRDQPFTI LYRDMDWSGRS VPAD I FQMQATTR 348 

I I I I : II I II I I I I : : II I :: 

Db 553 -AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPAWFRMGPSSA 611 

Qy 349 Y PGA Y Y I FQ I KSGNEGREF YMRQTGP I SATLVMTR P I KG PRE I QLDLEM I TVNTVI NFRG 408 

II I III I h I I : :|=|= II- -II - I 

Db 612 VPGDSMQLAITGGNEEGFFTTRKVSPHSGWALTKPVPEPRDL LLTVKMDLSRHG 666 

Qy 409 ---SSVIRLRIYVS 419 

I I i hll 
Db 667 TVSSFVAKLFIFVS 680 

RESULT 9 
Q99K58 

ID Q99K58 PRELIMINARY; PRT; 1174 AA. 

AC Q99K58; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to fibulin 2. 

GN FBLN2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL / GenBank / DDB J databases. 

DR EMBL; BC005443; AAH05443.1; -. 

DR HSSP; P00736; 1APQ. 

DR MGD; MGI: 95488; Fbln2 . 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 6. 



SMART; SM00104; ANATO; 3. 
SMART; SM00179; EGF_CA ; 9. 
PROSITE; PS01177; ANAPHYLATOXIN_l ; 3. 
PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 3. 
PROSITE; PS00010; ASX_HYDROXYL ; 5. 
PROSITE; PS01186; EGF_2 ; 5. 
PROSITE; PS01187; EGF_CA; 9. 
EGF-like domain. 

SEQUENCE 1174 AA; 126460 MW; 8D628AC710FBA6B8 CRC64 ; 

Query Match 30.4%; Score 731.5; DB 11; Length 1174; 

Best Local Similarity 35.3%; Pred. No. 3.8e-63; 

Matches 145; Conservative 60; Mismatches 147; Indels 59; Gaps 11; 



Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

= 1 HI I I hlhll III hi III 
Db 785 RCMDGF -LQDPEGNCVDI NECTSLLEPCRSGFSCI NTVGS YTC 826 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQM - DESNQCVD VDECATDSHQCNPTQI C 119 

Ihl II U -Nihil I hi hi 
Db 827 QRN PLVCGRG YHANEEGS E C VDVNE CETGVHRCGEGQLC 865 

Qy 120 I NTEGGYTCSCTDG YW - - LLEGQCLD I DECRYG YCQQLCANVPGSYSCTCNPGFTLN 174 

I I I I I h ! II I I 1 1 1 1 hi II I 

Db 866 YNLPGSYRCDCKPGFQRDAFGRTCIDVNECWVSPGRLCQHTCENTPGSYRCSCAAGFLLA 925 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEEDGVHCSDMDECS-FSEFLC 233 

lh hlllll I I I I I I I I Ihl III I = I : I I I : : || 
Db 926 ADGKHCEDVNECETRR-CSQECANIYGSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILC 984 

Qy 234 QHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCIDP 2 92 

III Ihl hll II - : llhl-ll I h Hhhll hh 

Db 985 TFRCVNVPGSYQCACPEQGYTMMANGRSCKDLDECALGTHNCSEAETCHNIQGSFRCL-R 1043 

Qy 293 I RCEEPYLRI SDNRCMCPAENPGCRD QPFTILYRDMDWSGRSVPADIFQMQAT 346 

I hhh :| I hi I I : - :| Ml II- 

Db 1044 FDCPPNYVRVSETKC ERTTCQDITECQTSPARITHYQLNFQTGLLVPAHI FRIGPA 1099 

Qy 347 TRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM 397 

= I I III I 1= : : : | : ||: ||:|| 

Db 1100 PAFAGDTI SLTI TKGNEEGYFVTRRLNAYTGWSLQRSVLEPRDFALDVEM 1150 

RESULT 10 
Q8MJJ9 

ID Q8MJJ9 PRELIMINARY; PRT; 598 AA. 

AC Q8MJJ9 ; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin-lc (Fragment) . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534 ; 

RN [1] 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



RP SEQUENCE FROM N.A. 

RA Brooke J.S., Cha J.-H., Eidels L. ; 

RT "Cloning of monkey fibulin-lc gene."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF395659; AAM90567.1; -. 

DR InterPro; IPR000020; Anaphy la toxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR Pfam; PF01821; ANATO; 1. 

DR Pfam; PF00008; EGF; 3. 

DR SMART; SM00104; ANATO; 1. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS01177; ANAPHYLAT0XIN_1 ; 1. 

DR PROSITE; PS 01 178; ANAPHYLATOXIN_2 ; 1. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 7. 

FT NON_TER 1 1 

SQ SEQUENCE 598 AA; 65516 MW; 84 9BF018DF4 52B02 CRC64 ; 



Query Match 30.0%; Score 721; DB 6; Length 598; 

Best Local Similarity 35.3%; Pred. No. 1.9e-62; 

Matches 155; Conservative 72; Mismatches 184; Indels 28; Gaps 15; 



Qy 2 CTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PR TNPVYRGPYSNP 58 

I h:| : I 111 = 1 : I I H I I = I h I : :| 

Db 164 CGTGYEL - TEDNSCKDI DQCESG I HNCLPDF I CQNTLGSFRCRPKLQCKNGF I QDALANC 222 

Qy 59 YS TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDESNQCVDVDECATDSHQC 113 

'III " I I II -I Ihlll = I 

Db 223 IDINECLSIVSAPCPTGHTCINTEGSYTQKNVPNCGRGYHLNEEGTRCDVNECAPPAEPC 282 

Qy 114 NPTQICINTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCN 168 

l = h -llh : hh:|| II III III hh 

Db 283 GKGHRCVNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYVCSCS 342 

Qy 169 PGFTLNEDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEE -DGVHCSDMDECS 227 

II ! Illlhhllh: :|| I I I llh I I I : ||| | I II- 

Db 343 VGFRLSVDGRSCEDI NECSS - S PCSQECANVYGS YQCYCRRGYQLSDVDGVTCEDI DECA 401 

Qy 228 F- -SEFLCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQ 284 

: I ' hi lh= I I I I II I = hlllhll I I- :|hhl 

Db 402 LPTGGHICSYRCINIPGSFQCSCPASGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQ 461 

Qy 285 GGFKCI DP I RCEEP YLRI SDNRC-MCPA- ENPGCRDQPFTI LYRDMDWSGRS VPADI FQ 342 

Mhh I I II : II INI |||: : || :|: 

Db 462 GGFRCL-AFECPENYRRSAATRCERLPCHENRECSKLPLRITYYHLSFPTNIQAPAWFR 520 

Qy 343 MQATTRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM- -ITV 400 

I II I III 1:1 :hh II- I I 

Db 521 MGPSSAVPGDSMQLAITGGNEEGFFTTRKVSPHSGWALTKPVPEPRDLLLTVKMDLYRH 580 

Qy 4 01 NTVINFRGSSVIRLRI YVS 419 

II M I :| hll 



Db 



581 GTVSSF VAKLF I FVS 595 



RESULT 11 
073774 

ID 073774 PRELIMINARY; PRT; 704 AA. 

AC 073774; 

DT 01-AUG-1998 (TrEMBLrel . 07, Created) 

DT 01-AUG-1998 (TrEMBLrel. 07, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin-1, isoform D. 

GN FBLN1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

0C Gallus. 

OX NCB I JTax I D= 9 0 3 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99120531; PubMed=9 923 656 ; 

RA Barth J.L., Argraves K.M., Roark E.F., Little CD., Argraves W.S.; 

RT "Identification of chicken and C. elegans fibulin-1 homologs and 

RT characterization of the C. elegans fibulin-1 gene."; 

RL Matrix Biol. 17:635-646(1998). 

DR EMBL ; AF051399; AAC05387.1; -. 

DR HSSP; P00742; 1HCG. 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; IPR006209; EGF_like. 

DR Pfam; PF01821; ANATO; 2. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00104; ANATO; 3. 

DR SMART; SM00179; EGF_CA; 8. 

DR PR0SITE; PS01177; ANAPHYLAT0XIN_1 ; 1. 

DR PROSITE; PS01178; ANAPHYLAT0XIN_2 ; 2. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 5. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF_CA; 8. 

KW EGF -like domain. 

SQ SEQUENCE 704 AA; 78137 MW; D47D5A3 0D5E42 932 CRC64 ; 

Query Match 2 9.6%; Score 713; DB 13; Length 704; 

Best Local Similarity 35.1%; Pred. No. 1.4e-61; 

Matches 160; Conservative 64; Mismatches 154; Indels 78; Gaps 21; 

Qy 1 QCTNGFDLDRQSGQCLDI DECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II III I I I! : I i I I 

Db 294 QCMNGF I QD - ALGNCI D I NECLSTNMPCPAGQ I CI NTDGS YTC - QRI S P 340 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQMDE - SNQCVDVDECATDSHQCNPTQI C 119 

I II ::| :|||||||:: I :| 

Db 341 SCGRGYHLNEDGTRCVDVDECSSSDQPCGEGHVC 374 

QY 120 INTEGGYTCSCTDGYW- -LLEGQCLDIDECRY GYCQQLCANVPGSYSCTCNPGFTLN 174 

M I I I I II :: hi |:||| | I MM Ml I II h 

Db 375 INGPGNYRCECKSGYSFDVISRTCIDINECRRYPGRLCAHKCENTPGSYYCTCTMGFKLS 434 



Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFICRCDPGYELEE-DGVHCSDMDECSF- -SEF 231 

Mllhhlll : :|| II I llh I I I : lh I 1 = 111 = 

Db 435 SDGRSCEDLNECES - SPCSQECANVYGS YQCYCRRGFQLSDI DGI SCEDI DECALPTGGH 493 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

I hi lh: hll II I : hlllhll I h :|hhlllhh 

Db 494 I CSFRCI NI PGSFQCTCPSTGYRLAPNARNCQDI DECVAETHNCSFNETCFNI QGGFRCL 553 

Qy 291 DPIRCEEPYLRISDN RCM- -CPAENPGC-RDQPFTILYRDMDWSGRSV- - 336 

Db 554 S - LECPENYRKSGDTVRLEKTDTI RCI KSCRPNDVNCVLDPVHTI SHTVI SLPTFREFTR 612 

Qy 337 PAD I FQMQA-TTRYPG- -AYYI FQI KSGNEGREF YMRQTG P I S ATL VMTR P I KG P 388 

I I :H I II I Ml II I II I : Ml II 

Db 613 PEEI IFLRAITPTYPANQADI IFDITEGNLRESFDI IKRYM- -DGMTVGWRQVRPIVGP 670 

Qy 389 RE I QLDLEM - I TVNTVI NFRGSS VI RLRI YVSQYPF 423 

I III : I- I :- = hlhl I 
Db 671 FHA I LKLEMN Y VMGG WSHR - - N I VN VH I F VS E YWF 704 



RESULT 12 
Q9Y3V7 

ID Q9Y3V7 PRELIMINARY; PRT; 576 AA. 

AC Q9Y3V7; 

DT 01-NOV-1999 (TrEMBLrel . 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein (Fragment) . * 

GN DKFZP586A1519. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Uterus; 

RA Wambutt R., Heubner D. , Mewes H.W., Gassenhuber J., Wiemann S.; 

RL Submitted (MAR-1999) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AL050095; CAB43267.1; 

DR HSSP; P00736; 1APQ. 

DR InterPro; IPR000152; Asx_hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 

DR InterPro; I PRO 062 09; EGF_like. 

DR Pfam; PF00008; EGF; 6. 

DR SMART; SM00179; EGF_CA; 8. 

DR PROSITE; PS00010; ASX_H YDROXYL ; 4. 

DR PROSITE; PS01186; EGF_2 ; 4. 

DR PROSITE; PS01187; EGF_CA; 9. 

KW Hypothetical protein; EGF-like domain. 

FT NON_TER 1 1 

SQ SEQUENCE 576 AA; 63274 MW; ' FF2F0E9B18 5D8AC3 CRC64 ; 

Query Match 2 9.2%; Score 703.5; DB 4; Length 576; 

Best Local Similarity 34.5%; Pred. No. 9.5e-61; 

Matches 142; Conservative 58; Mismatches 152; Indels 59; Gaps 11; 



Qy 


l 


Db 


187 


Qy 


61 


Db 


229 


Qy 


120 


Db 


268 


Qy 


175 


Db 


328 


Qy 


234 


Db 


387 


Qy 


293 


Db 


446 


Qy 


347 


Db 


502 



QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 
I I I | hlhll III hi III 



II I I II I Mllh II I hi hi 



IGGYTCSCTDGYW--LLEGQCLDIDECRYG YCQQLCANVPGSYSCTCNPGFTLN 174 

Mil h hh:|| II I I Ml hi || | 



: hlllll : I I I I Mh I I 11 = 1 III ! : II 

IKRCEDVNECEAQR - CSQECANI YGS YQCYCRQGYQLAEDGHTCTDI DECAQGAGI LC 

ICVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP 

hi I!': | hi | || : : Mhh:M I h Hhhll hh 

^CLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHNIQGSFRCL-R 

:eepylrisdnrcmcpaenpgcrd QPFTILYRDMDWSGRSVPADIFQMQAT 

h = = l :| III I I : :: :| III ||:: 

:ppnyvqvsktkc ERTTCHDFLECQNS PARI THYQLNFQTGLLVPAHI fr I gpa 

'PGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM 3 97 
I I III I h : : : | : ||: Ihll 



RESULT 13 
Q8IUI1 

ID Q8IUI1 PRELIMINARY; PRT; 1231 AA. 

AC Q8IUI1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 2 . 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxI D= 9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Li D. , Marian A. J., Roberts R. ; 

RT "Identification of a novel alternatively spliced isoform of human 

RT fibulin-2 gene abundantly expressed in heart and genetic evaluation in 

RT patients with ARVD . " ; 

RL (In) Unknown A. (eds.); 

RL ABSTRACTS OF THE 52ND ANNUAL MEETING OF THE AMERICAN SOCIETY OF HUMAN 

RL GENETICS, pp. 323-323, Unknown Publisher (2002). 

DR EMBL; AY130458; AAN05435.1; -. 

DR EMBL; AY130456; AAN05435.1; JOINED. 

DR EMBL; AY130457; AAN05435.1; JOINED. 

SQ SEQUENCE 1231 AA; 131853 MW; 0A75D1C27F258D48 CRC64 ; 



Query Match 29.2%; Score 702.5; DB 4; Length 1231; 

Best Local Similarity 34.5%; Pred. No. 2.9e-60; 

Matches 142; Conservative 58; Mismatches 152; Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTIPEACRGDMMCVNQNGGYLCIPRTNPVYRGPYSNPYS 60 

H M I I hlhll I I I hi III 
Db 842 RCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC 883 

Qy 61 TPYSGPYPAAAPPLSAPNYPTI SRPLI CRFGYQ-MDESNQCVDVDECATDSHQCNPTQI C 119 

I I I I II h Hllhll I hi hi 
Db 884 QRNPLI CARGYHASDDGXKCVDVNECETGVHRCGEGQVC 922 

Qy 120 I NTEGGYTCSCTDGYW- - LLEGQCLDI DECRYG YCQQLCANVPGSYSCTCNPGFTLN 174 

I I I I I h I II I I III hi II I 

Db 923 HNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENTLGSYRCSCASGFLLA 982 

Qy 175 EDGRSCQDVNECATENPCVQTCWTYGSFICRCDPGYELEEDGVHCSDMDECS-FSEFLC 233 

11= Ml = I I I I Mh | | M l Ml | : | : | | |: = II 
Db 983 ADGKRCEDWECEAQR-CSQECANI YGSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILC 1041 

Qy 234 QHECVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP 292 

hi Ihl hll M : : I I I = I = = I I I h =11 = 1=1! hh 

Db 1042 TFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHNIQGSFRCL-R 1100 

Qy 293 I RCEEPYLRI SDNRCMCPAENPGCRD QPFTI LYRDMDWSGRS VPADI FQMQAT 346 

I h = = l H III I I : H III II- 

Db 1101 FECPPNYVQVSKTKC ERTTCHDFLECQNS PARI THYQLNFQTGLLVPAHI FRI GPA 1156 

Qy 347 TR YPGAY Y I FQ I KSGNEGRE F YMRQTG P I SATLVMTRP I KG PRE I QLDLEM 397 

= 1 I III I h : : : | : ||: h| 

Db 1157 PAFTGDTI ALNI I KGNEEGYFGTRRLNAYTGVVYLQRAVLEPRDFALDVEM 1207 



RESULT 14 
Q8IUI0 

ID Q8IUI0 PRELIMINARY; PRT; 1231 AA. 

AC Q8IUI0; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Fibulin 2. 

GN FBLN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa,- Chordata,- Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Li D., Marian A.J. , Roberts R. ; 

RT "Identification of a novel alternatively spliced isoform of human 

RT fibulin-2 gene abundantly expressed in heart and genetic evaluation in 

RT patients with ARVD . " ; 

RL (In) Unknown A. (eds.),- 

RL ABSTRACTS OF THE 5 2ND ANNUAL MEETING OF THE AMERICAN SOCIETY OF HUMAN 

RL GENETICS, pp. 323-323, Unknown Publisher (2002). 

DR EMBL; AY130459; AAN05436.1; -. 

SQ SEQUENCE 1231 AA; 131790 MW; 95D69EB2082952A7 CRC64 ; 



Query Match 29.1%; Score 701.5; DB 4; Length 1231; 

Best Local Similarity 34.5%; Pred. No. 3.6e-60; 

Matches 142; Conservative 58; Mismatches 152; t Indels 59; Gaps 11; 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

I :|| I | hlhll :: I hi III 
Db 842 RCMDGF-LQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTC 883 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQ-MDESNQCVDVDECATDSHQCNPTQIC 119 

I I I I II h :||lhll I hi hi 
Db 884 QRNPLI CARGYHASDDGAKCVDVNECETGVHRCGEGQVC 922 

Qy 120 INTEGGYTCSCTDGYW- -LLEGQCLDIDECRYG YCQQLCANVPGSYSCTCNPGFTLN 174 

I I I I I h l II I I III hi II I 

Db 923 HNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHTCENTLGSYRCSCASGFLLA 982 

Qy 175 EDGRSCQDVNECATENPCVQTCVNTYGSFI CRCDPGYELEEDGVHCSDMDECS - FSEFLC 233 

Ih hlllll II llh I I Ihl III hhllh : II 

Db 983 ADGKRCEDVNECEAQR-CSQECANI YGSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILC 1041 

Qy 234 QHECVNQPGTYFCSCP - PGYI LLDDNRSCQDI NECEHRNHTCNLQQTCYNLQGGFKCI DP 292 

hi Ihl hll II = : llhh:|| I h :|hhll hh 

Db 1042 TFRCLNVPGS YQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHN I QGS FRCL - R 1100 

Qy 293 IRCEEPYLRISDNRCMCPAENPGCRD QPFTI LYRDMDWSGRSVPADI FQMQAT 346 

I h = = l U III I I : ■ ■ - I III lh = 

Db 1101 FECPPNYVQVSKTKC ERTTCHDFLECQNSPARITHYQLNFQTGLLVPAHIFRIGPA 1156 

Qy 347 TRYPGAYYIFQIKSGNEGREFYMRQTGPISATLVMTRPIKGPREIQLDLEM 397 

= 1 I III I h : : : | : ||: Ihll 

Db 1157 PAFTGDTIALNI I KGNEEGYFGTRRLNAYTGWYLQRAVLEPRDFALDVEM 1207 



RESULT 15 
Q8NBH6 

ID Q8NBH6 PRELIMINARY; PRT; 638 AA. 

AC Q8NBH6 ; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein NT2RP3003649 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_TaxI D= 9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Ota T., Nishikawa T. , Suzuki Y. , Kawai-Hio Y. , Hayashi K. , Ishii S., 

RA Saito K. , Yamamoto J., Wakamatsu A., Nagai T. , Nakamura Y., 

RA Nagahari K. , Sugano S., Isogai T. ; 

RT "HRI human cDNA sequencing project."; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AK075566; BAC11705.1; 

DR InterPro; IPR000020; Anaphylatoxin . 

DR InterPro; IPR000152; Asx__hydroxyl . 

DR InterPro; IPR001881; EGF_Ca . 



DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR Pfam; PF01821; ANATO; 1. 

DR Pfam; PF00008; EGF; 4. 

DR SMART; SM00181; EGF; 9. 

DR SMART; SM00179; EGF_CA; 9. 

DR PROSITE; PS00010; ASX_HYDROXYL; 4. 

DR PROSITE; PS01186; EGF_2 ; 3. 

DR PROSITE; PS01187; EGF__CA; 8. 

KW Hypothetical protein. 

SQ SEQUENCE 638 AA; 70577 MW; EBC0DE3 14 7A762 IF CRC64; 

Query Match 28.9%; Score 695; DB 4; Length 638; 

Best Local Similarity 34.5%; Pred. No. 7.4e-60; 

Matches 159; Conservative 65; Mismatches 149; Indels 88; Gaps 21 

Qy 1 QCTNGFDLDRQSGQCLDIDECRTI PEACRGDMMCVNQNGGYLCI PRTNPVYRGPYSNPYS 60 

II = 11 I I MMI : | | hi Ml 
Db 228 QCKSGFIQD-ALGNCIDINECLSISAPCPIGHTCINTEGSYTC 269 

Qy 61 TPYSGPYPAAAPPLSAPNYPTISRPLICRFGYQM-DESNQCVDVDECATDSHQCNPTQIC 119 

: II I II : =| I 'I : : I I 

Db 270 QKNVPN CGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRC 308 

Qy 120 INTEGGYTCSCTDGYWL- -LEGQCLDIDEC-RY- -GYCQQLCANVPGSYSCTCNPGFTLN 174 

\ ■ \ \ lh : |:|::|| || | I ||| |:|: || |: 

Db 3 09 VNSPGSFRCECKTGYYFDGISRMCVDVNECQRYPGRLCGHKCENTLGSYLCSCSVGFRLS 368 

Qy 175 EDGRS CQDVNECATENPCVQTCVNTYGSF I CRCDPGYELEE - DGVHCSDMDECSF - - SEF 231 

llllhhllh: :|| I I I llh I I Ihl : III | hi I 

Db 369 VDGRSCEDINECSS-SPCSQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGH 427 

Qy 232 LCQHECVNQPGTYFCSCP-PGYILLDDNRSCQDINECEHRNHTCNLQQTCYNLQGGFKCI 290 

: I = hi II- I I I I II I = hlllhll I I- :||:|:|llh|: 

Db 428 I CS YRCINI PGSFQCSCPSSGYRLAPNGRNCQDI DECVTGI HNCS INETCFNI QGGFRCL 487 

Qy 291 DPIRCEEPYLRISDN RCMCPAENPGCRDQPFTI LYRDMDWSGRSV 336 

MM: lh II | : : : : I' : 

Db 488 - AFECPENYRRSAATLQQEKTDTVRC I KSCRPNDVTCVFDPVHTISHTVISLPTF 541 

Qy 337 PAD I FQMQA-TTRYPG- -AYYI FQI KSGNEGREF YMRQTGP I SATLVMTR 383 

I =1 : = l I :| I II I II I M I : | 

Db 542 REFTRPEEI IFLRAITPPHPASQANI IFDITEGNLRDSFDIIKRYM- -DGMTVGWRQVR 599 

Qy 384 P I KGPREI QLDLEM- I TVNTVINFRGSS VI RLRI YVSQYPF 423 

II II I III I h = I :h : |:||:| 

Db 600 P I VGPFHAVLKLEMN YWGG WSHR - -NWNVH I FVSEYWF 638 



Search completed: January 9, 2004, 12:36:57 
Job time : 33.5385 sees 



